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Abstract 

In two papers entitled "On a new foundation [Neue Begriindung] of quantum me- 
chanics," Pascual Jordan (1927b, g) presented his version of what came to be known 
as the Dirac-Jordan statistical transformation theory. As an alternative that avoids 
the mathematical difficulties facing the approach of Jordan and Paul A. M. Dirac 
(1927), John von Neumann (1927a) developed the modern Hilbert space formalism 
of quantum mechanics. In this paper, we focus on Jordan and von Neumann. Cen- 
tral to the formalisms of both are expressions for conditional probabilities of finding 
some value for one quantity given the value of another. Beyond that Jordan and 
von Neumann had very different views about the appropriate formulation of prob- 
lems in quantum mechanics. For Jordan, unable to let go of the analogy to classical 
mechanics, the solution of such problems required the identification of sets of canon- 
ically conjugate variables, i.e., p's and g's. For von Neumann, not constrained by 
the analogy to classical mechanics, it required only the identification of a maximal 
set of commuting operators with simultaneous eigenstates. He had no need for p's 
and g's. Jordan and von Neumann also stated the characteristic new rules for prob- 
abilities in quantum mechanics somewhat differently. Jordan (1927b) was the first 
to state those rules in full generality. Von Neumann (1927a) rephrased them and, 
in a subsequent paper (von Neumann, 1927b), sought to derive them from more 
basic considerations. In this paper we reconstruct the central arguments of these 
1927 papers by Jordan and von Neumann and of a paper on Jordan's approach by 
Hilbert, von Neumann, and Nordheim (1928). We highlight those elements in these 
papers that bring out the gradual loosening of the ties between the new quantum 
formalism and classical mechanics. 
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1 Introduction 



1.1 The Dirac- Jordan statistical transformation theory 



On Christmas Eve 1926, Paul A. M. Dirac, on an extended visit to Niels Bohr's 
institute in Copenhagen, wrote to Pascual Jordan, assistant to Max Born in 
GottingenQ 

Dr. Heisenberg has shown me the work you sent him, and as far as I can see 
it is equivalent to my own work in all essential points. The way of obtaining 
the results may be rather different though ... I hope you do not mind the 
fact that I have obtained the same results as you, at (I believe) the same 
time as you. Also, the Royal Society publishes papers more quickly than 
the Zeits. f. Phys., and I think my paper will appear in their January issue. 
I am expecting to go to Gottingen at the beginning of February, and I am 
looking forward very much to meeting you and Prof. Born there (Dirac to 
Jordan, December 24, 1916, AHQP)0 



Dirac's paper, "The physical interpretation of the quantum dynamics" ((Dirac 



19271 ). had been received by the Royal Society on December 2. Zeitschrift fur 
Physik had received Jordan's p aper, "On a n ew foundation [Neue Begriin- 
dung] of quantum mechanics" ( Jordan . 1927b ). on December 18JZI In both 



* This paper was written as part of a joint project in the history of quantum physics 
of the Max Planck Institut fur Wissenschaftsgeschichte and the Fritz- Haber-Institut 
in Berlin. 

* Corresponding author. Address: Tate Laboratory of Physics, 116 Church St. SE, 
Minneapolis, MN 55455, USA, Email: janss011@umn.edu 

1 Reminiscing about the early days of quantum mechanics, Jordan (jl955l ) painted 



an idyllic picture of Gottingen during this period, both of the town itself and of its 
academic life. J. Robert Oppenheimer, who spent a year there in 1926-1927, also 
remembered a darker side: "[AJlthough this society was extremely rich and warm 
and helpful to me, it was parked there in a very very miserable German mood which 
probably in Thuringia was not as horrible as a little bit further east; Gottingen is not 
in Thuringia — not as bad as in Thuringia — but it was close enough to Thuringia 
to be bitter, sullen, and, I would say, discontent and angry and loaded with all 
those ingredients which were later to produce a major disaster" (Interview with 
Oppenheimer for the Archive for History of Quantum Physics [AHQP], session 2, 
p. 5; quoted in part by Bird a nd Sherwin, 2005, pp. 57-58). For detailed references 
to material in the AHQP, see Kuhn et al. (jl967l ). 



2 



This lette r is quoted almost in its entirety fcf . note 42) and with extensive com 



mentary by lMehra and Rechenbergl (|2000-200ll . p. 72, pp. 83-87). 
3 We will refer to this paper as Neue Begriindung I to distinguish it from Neue 
Begriindung II, submitted June 3, 1927 to the same jo urnal, a sequel in which 
Jordan tried both to simplify and generalize his theory ( Jordan . 1927gl ). A short 
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cases it took a month for the paper to get published: Dirac's appeared Jan- 
uary 1, Jordan's January 18 PI What Dirac and Jordan, independently of one 
another, had worked out and presented in these papers has come to be known 
as the Dirac- Jordan (statistical) transformation theory!"^! As Jordan wrote in 
a volume in honor of Dirac's 70th birthday: 

After Schrddinger's beautiful papers [Schrodinger, 1926a], I formulated what 
I like to call the statistical transformation theory of quantum mechanical 
systems, answering generally the question concerning the probability of find- 
ing by measurement of the observable b the eigenvalue b', if a former mea- 
surement of another observable a had given the eigenvalue a'. The same 
answer in the same generality was developed in a wonderful manner by 
Dirac ( iJordanl . Il973l . p. 296; emphasis in original). 



In our paper, we focus on Jordan's version of the theory and discuss Dirac's 
version only to indicate how it differs from Jordan's!"^! 

Exactly one month before Dirac's letter to Jordan, Werner Heisenberg, Bohr's 
assistant in the years 1926-1927, had already warned Jordan that he was 
about to be scooped. In reply to a letter, apparently no longer extant, in 
which Jordan must have given a preview of Neue Begriindung I, he wrote: 

I hope that what's in your paper isn't exactly the same as what's in a paper 
Dirac did here. Dirac's basic idea is that the physical meaning of S a p, given 
in my note on fluctuations, can greatly be generalized, so much so that it 
covers all physical applications of quantum mechanics there have been so 
far, and, according to Dirac,all there ever will be (Heisenberg to Jordan, 
November 24, 1926, AHQP)[Z] 



The "note on fluctuations" is a short paper received by Zeitschrif t fur Phusik 
on Noy ember 6 but not published until December 20. In this note, iHeisenberg 
( 1l926bl ). drawing on the technical apparatus of an earlier paper on resonance 



version of Neue Begriindung I was presented to the Gottingen Acad emy by Born on 
behalf of Jordan in the session of January 14, 1927 (jJordanl. Il927d) . 

4 The extensive bibliography of lMehra and Rechenbera ( 2000-200 ll ) gives the dates 
papers were received and published for all the primary literature it lists. 

5 The term 'transformation theory' — and even the term 'statistical transformation 
theory' — is sometimes used more broadly (see cf. note 50). 



For discussions of Dirac's vers ion, see, e.g.. iJammerl (119661. pp. 302-305). iKragh 



7 2-89), and iRechenberg 



20ld 543-54 8). 



(jl99(l pp. 3 9-43) .lDarrigoll(ll992l . pp. 337-345) JMehr a and Rechenberel (|2000-2001 



7 iMehra and Rechenbergl ( 2000-20011 . p. 72) quote more extensively from this let- 
ter. They also quote and discuss (ibid., p. 78) a s imilar rema rk in a letter from 
Heisenberg to Wolfgang Pauli, November 23, 1926 (|Paulil . Il979l . Doc. 148, p. 357). 
In this letter, Heisenberg talks about Dirac's "extraordinary grandiose [grofiziigige] 
generalization" (quoted by Kragh, 1990, p. 39). 
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phenomena (jHeisenbergl . Il926al ) , analyzed a simple system of a pair of iden- 
tical two-state atoms perturbed by a small interaction to allow for the flow- 
ing back and forth of the amount of energy separating the two states of the 
atomslf] The S a ^s are the elements of a matrix S implementing a canonical 
transformation, /' = S~~ 1 fS, from a quantum-mechanical quantity / for the 
unperturbed atom to the corresponding quantity /' for the perturbed one. 
The discrete two- valued indices a and /3 label the perturbed and unperturbed 
states, respectively. Heisenberg proposed the following interpretation of these 
matrix elements: 



If the perturbed system is in a state a, then \S a /3\ 2 gives the probability that 
(because of collision processes, becau se the perturbation suddenly stops, etc.) 
the system is found to be in state (3 (jHeisenbergl . Il926bl . p. 505; emphasis in 
the original). 



Heisenberg! (Il926bl . pp. 504-505) emphasized that the same analysis applies 
to fluctuations of other quantities (e.g., angular momentum) and to other 
quantum systems with two or more not necessarily identical components as 
long as their spectra all share the same energy gap. 



In the introduction of his paper on transformation theory. iDirad (119271 . p. 622) 
briefly described Heisenberg's proposal, thanked him for sharing it before pub- 
lication, and announced that it is "capable of wide extensions." Dirac, in fact, 
extended Heisenberg's interpretation of S a p to any matrix implementing a 
transformation of some quantum-mechanical quantity g from one matrix rep- 
resentation, written as g(a'a>"), to another, written as g(£'£"). The primes on 
the indices labeling rows and columns distinguish the numerical values of the 
quantities a and £ from those quantities themselves (ibid., p. 625). Depending 
on the spectrum of the associated quantities, these indices thus take on purely 
discrete values, purely continuous ones, or a mix of both. Dirac wrote all equa- 
tions in his paper as if the indices only take on continuous values (ibid.). He 
introduced the compact notation (£'/«') for the transformation matrix from 
the a-basis to the £-basis (ibid., p. 630). In the spirit of Heisenberg's proposal, 
/cx')\ 2 d£' is interpreted as the probability that £ has a value between £' and 
£' + d£' given that a has the value a'. Although the notation is Dirac's, this 



8 As iHeisenberd ( 1926bl . p. 501) explaine d in the i ntroduction of his paper, fluctu- 
ations were of interest because, as Albert Einstein (|l905l . llQOQal fbh had shown, they 
provide a tell-tale sign of quantum discontinuities. For discussion of Heisenberg's 
(1926c) note on fluctuations in the context of the debate over Jordan's deriva- 
tion of Einstein's (1 909a,b) formula for fluctuations in b lack- body radiation in the 
Dreimdnnerarbeit of Born. Heisenberg. and Jordan ( 19261 ). see Duncan and Janssen 
(200 a pp. 643-6 45). For further discussion of Heisenberg's (1926c) note 



Rechenbergl feoid . pp. 541-543). 



sec 
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formulation of the interpretation is Jordan's (1927b, p. 813)0 

It is easy to understand why Heisenberg would have been excited about what 
he saw as Dirac's generalization of his own work. He felt strongly that the 
interpretation of the quantum formalism should naturally emerge from ma- 
trix mechanics without any appeal to wave mechanics. For this reason, he 
initially disliked Born's statistical interpretation of the wave function as well 
as Bohr's concept of complementarity, with its emphasis on wave-particle du- 
alityH Transformation theory reconciled him with both ideas. It showed that 
wave-particle duality was just one example of a much broader plurality of 
equivalent forms in which quantum mechanics can be expressed!"^"! And it 
showed that the Schrodinger energy eigenf unctions can be seen as elements of 
the transformation matrix diagonalizing the Hamiltonian. As Dirac put it: 

The eigenf unctions of Schrodinger 's wave equation are just the transfor- 
mation functions . . . that enable one to transform from the (q) scheme of 
matrix representatio n to a scheme in which the Hamiltonian is a diagonal 



matrix (IDirad . 119271 . p. 635; emphasis in the original) 



12 



For carefu l discu ssion of Dirac's (1927, sees. 6-7, pp. 637-641) own formulation, 



see 

10 



Darrigoll (|1992 . pp. 342-343). 



Heisenberg explicitly said so when interviewed by Kuhn for the AHQP project. 
Talking about Born's statistical interpretation of the wave function and his own 
proposal in his note on fluctuations, he said: "I definitely wanted to keep always 
on the quantum mechanical side and not make any concession to the Schrodinger 
side" (session 10, p. 6). Talking about complementarity, he likewise said: "I didn't 
want to go too much into the Schrodinger line of thought. I just wanted to stick 
to the matrix line" (session 11, p. 11). He made a similar statement in his autobi- 
ography: "Bohr was trying to allow for the simultaneous existence of both particle 
and wave concepts ... I disliked this approach. I wanted to start from the fact 
that quantum mechanics as we then knew it [i.e., matrix mechanics] already im- 
posed a unique physical interpretation . . . For that reason I was — certainly quite 
wrongly — rather unhappy about a brilliant piece of work Max Born had done ... I 
fully agreed with Born's thesis as such, bu t disliked the fact that it looked as if we 
still had some freedom of interpretation" (jHeisenbergl . Il97ll . p. 76). The following 
remark in the conclusion of one of the two papers in which Born first introduced his 
statis t ical int erpretation makes it easy to understand Heisenberg's initial animosity. 



Bornl (|1926bl . p. 826) wrote: "Schrodinger's form [of quantum mechanics] appears 



to account for the facts in by far the easiest way; moreover, it makes it possible to 
retain the usual conception of space and time in which events take place in perfectly 
familiar fashion." 

11 Heisenberg made this contrast between the dualism of complementarity and the 
pluralism of transformation theory in one of the passages from the AHQP interview 
referred to in the preceding note (session 11, p. 11). 

12 Thi s passag e is quoted in most discu ssions of Dirac's p aper. See, e.g., 
(1 19661 . p. 403). lKraghl (1 199(1 p. 40), and iRechenbeTe] |j20ld . p. 547). 



Jammer 
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Jordan! (I1927bl . p. 810, p. 822) clearly recognized this too. The probability in- 
terpretation could thus either be given in terms of Schrodinger wave functions 
or in terms of transformation matrices, 
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Heisenberg explicitly made the connection between wave functions and trans- 
formation matrices in the letter to Jordan from which we quoted above: 

S is the solution of a transformation to principal axes and also of a differen- 
tial equation a la Schrodinger, though by no means always in g-space. One 
can introduce matrices of a very general kind, e.g., S with indices S(q,E). 
The S(q, E) that solves Born's transformation to principal axes in Qu.M. II 
(Ch. 3, Eq. (13)) is Schrodiriger's S(q,E) = ipE^q) (Heisenberg to Jordan, 
November 24, 1926, AHQP)F 



14 



Eq. (13) in Ch. 3 o f "Qu.M. II," the famous Dreimann erarbeit, is H(pq) = 
SH^q^S^ 1 = W (IBorn. Heisenberg. and Jordan! . Il926l . p. 351). This equa- 
tion summarizes how one solves problems in matrix mechanics: one has to 
find the matrix S for a "transformation to principal axes," i.e., a canon- 
ical transformation from initial coordinates (p°q°) to new coordinates (pq) 
in which the Hamiltonian H becomes the diagonal matrix W . In Ch. 3 of 
the Dreimannerarbeit, i.e., prior to Schrodinger 's work, Born, who wrote the 
chapter, had already come close to making the connection Heisenberg makes 
here between transformation matrices S and solutions ipE{q) of the time- 
independent Schrodinger equation 



15 



Expanding on his comments in the letter to Jordan, Heisen berg began his next 
paper — the sequel to the paper on resonance ph enomena (|Heisenberd . Il926al ) 
that he had used for his note on fluctuations (IHeisenbergi . Il926bl ) — with a 
three-page synopsis of the remarkable new formalism that subsumed both wave 
and matrix mechanics (IHeisenbergi . Il927al . sec. 1, pp. 240-242). This paper was 
received by Zeitschrift fiir Physik on December 22, 1926. The new formalism 



13 The passage from Heisenberg's autobiography quoted in note 10 continues: "I 
was firmly convinced that Born's thesis itself was the necessary consequence of the 
fixed interpretation of special magnitudes in quantum mechanics. This conviction 
was strengthe ned further by tw o highly informative mathematical studies by Dirac 
and Jordan" (jHeisenbereJ . Il97ll . pp. 76-77). 

14 Heisenberg added: "The more general cases can be seen as a simple appli- 
cation of Lanczos's "field equations" (the same might be true of your <p(y) = 
f K(x,y) ijj(x) dxl l ). I h ave said all along that this Lanczos is not bad" (ibid.). 
Cornelius Lanczosl ( 19261 ) wrote a paper on the connection between matrices and 
integral kern els (such as K(x, y) i n the e quation above). This paper is cited by both 
Dirad (jl927l p. 624) a nd I Jordan! dl927bl. p. 832 ) For discussion of Lanzcos's (1926) 
contribution, see, e.g., Mehra and Rechenberg! (1987, pp. 665-668). 

15 For a reconstruction in m odern language of the steps Born was still missing, see 
Duncan and Janssen ( 20091 . p. 355). 
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was of such recent vintage at that point that iHeisenberg (11927al. p. 240, note) 



had to list t he thre e main sources he cited for it (jDirad . 119271 ; iJordanl . Il927b ; 
Heisenberg . Il926bl ) as "forthcoming" [im Erscheinen}. He made much of the 
third item, his own note on fluctuations. Jordan, he wrote, "has independently 
found results that are equival ent to those of the D irac paper and those of a pre- 
ceding paper by the author" (IHeisenbergi . Il927al . p. 240, note; our emphasis). 
He referred to the latter again when he introduced the probability interpreta- 
tion of arbitrary transformation matrices (ibid., p. 242), Dirac's far-reaching 
generalization of his own proposal in his note on fluctuations. Heisenberg cited 
Jordan's paper in addition to his own note but not Dirac's paper. Moreover, 
he did not mention Born's (1926a,b) statistical interpretation of the wave 
function anywhere in this brief exposition of the new general formulation of 
quantum mechanics. This undoubtedly reflects Heisenberg' s initial hostility 
toward Born's seminal contribution (see note 10) 



16 



In his later reminiscences, Heisenberg did give Born and Dirac their due (cf. 
note 10). Discussing the statistical interpretation of the quantum formalism 
in his contribution to the Pauli memorial volume, for instance, he mentioned 
Born, Dirac, and Pauli along with his own modest contribution (see the itali- 
cized sentence below). On this occasion, however, he omitted Jordan, perhaps 
because, for reasons we will discuss below, he much preferred Dirac's version 
of transformation theory over Jordan's. Heisenberg wrote: 



In the summer of 1926, Born developed his theory of collisions and, building 
on an earlier idea of Bohr, Kramers, and Slater, correctly interpreted the 
wave [fu nction] in multi-dimensional configuration space as a probability 
Pauli thereupon explained to me in a letter that Born's interpreta- 



wave 
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16 In h is paper on the uncerta inty p rinciple a few months l ater, Heisenberg 
dl927bl. p. 176, note) did ci t e iBornl dl926bll. but only after lEinsteinl (| 19251 ). 
Born. Heisenberg. and Jordan! (|1926l ) , and IJordanl (|1926al ) , three papers that have 
nothing to do with the statistical interpretation of wave functions or transformation 
matrices. In fairnes s to He isenberg, however, we should note that he now also cited 
an article by IBornl (]1927d ) on quantum mechanics and statistics in Die Naturwis- 
senschaften. 

17 Heisenberg repeatedly claimed that Born's statistical interpretation of the wave 
function owed a debt to the theory of Bohr. Kramers, and Slater ( 19241 ). according 
to which so-called 'virtual radiation' determines the probability of atomic transi- 
tions. Born himself took exception to this claim (AHQP interview with He isenberg, 



Jammer 



sessio n 4, p. 2). Yet, in his insightful discussion of Born's contribution _ 
( 19661 . p. 286) follows Heisenberg in identifying B KS as one of its r o ots. I n addition 
to the quantum collision papers (jBornl . Il926al lbl. Il927al ). Ijammerl (|l966l . pp. 283- 
290) disc usses two papers ela borating on the statistica l interpretatio n of the wave 
function ( Born . 1926c . 1927bl ). In the second of these, Born ( 1927bl . p. 356, note) 
emphasized that his theory "is not equivalent to that of Bohr, Kramers, Slater." 
Bornl ()1926bl . p. 804) did explicitly acknowledge one of the other roots identified by 
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tion is only a special case of a much more general interpretation prescription. 
He pointed out that one could, for instance, interpret \ip{p)\ 2 dp as the prob- 



ability that the particle has a momentum between p and p + <ip|__| This 
fit well with my own considerations about fluctuation phenomena. In the 
fall of 1926, Dirac developed his transformation theory, in which then in all 
generality the absolute squares of matrix ele ments of unitary tr ansforma- 
tion matrices were interpreted as probabilities (jHeisenbergl . Il960l p. 44; our 
emphasis). 

Born's work was undoubtedly more important for the development of the 
Dirac- Jordan statistical transformation theory than Heisenb erg's. Before ac- 
knowledging Heisenberg's note on fluctuations, iDirad ( 119271 . p. 621), in fact, 
had already acknowledged both the preliminary announcement and the full 
exposition of Born's (1926a,b) theory of quantum collisions, published in July 
and September of 1926, respectively. These papers contained the statistical 
interpretation of the wave function for which Born was awarded part of the 
1954 Nobel Prize in physics. 



Concretely, iBorn (Il926bl . p. 805) suggested that, given a large number of sys- 
tems in a superposition %l)(q) = J2 n c n of energy eigenfunctions ip n {Q), the 
fraction of systems with eigenfunctions ]b n (q) is given by the absolute square of 



the compl e x expa nsion coefficients c n |__| In the preliminary version of the pa- 



per, iBornl ( 11926al p. 865) introduced his probability interpretation examining 
the case of inelastic scattering of an electron by an atom. He wrote the wave 
function of the electron long after and far away from the point of interaction 
as a superposition of wave functions for free electrons flying off in different 
directions. As in the case of the expansion in terms of energy eigenstates men- 
tioned above, Born interpreted the absolute square of the coefficients in this 
expansion in terms of free electron wave functions as the probability that the 
electron flies off in a particular direction. He famously only added in a footnote 
that this probability is not given by these coefficients themselves but by their 



Jammer ( 19661 . p. 285): Einstein's proposal to interpret the electromagnetic field as 
a guiding field for light quanta. F or another discussion of Born's stat istical inter- 
pretation of the wave function, see lMehra and Rechenberd (|2000-200ll . pp. 36-55). 

18 See Pauli to Heisenberg, October 19, 1926 (jPauil ll979j, Doc. 143; the relevant 
passage of this 12-page letter can be found on pp. 347-348). Whe n listing his sources 
for the presentation of transformation theory mentioned above, iHeisenbera (I1927al . 
p. 240, note) re ferred to that same letter : "Several of these results [of Dirac ( 19271 ). 
Jordanl (|l927bl h and iHeisenber^ (|l926bh ] had already been communicated earlier 
and independently to me by Mr. W. Pauli." 

19 We will interpret 'fraction of systems with a particular energy eigenfunction' as 
'fraction of systems found to be in a particular energy eigenstate upon measurement 
of the energy,' even though Born did not explicitly say this. A careful distinction 
between pure and mixed states also had yet to be made (see von Neumann, 1927b, 
which we will discuss in Section 6). 
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absolute square ( iBornl . Il926al p. 865). 



In Neue Begriindung I, Jordan ( 1927bl . p. 811) cited Born's second longer pa- 
per on quantum collisions and a more recent on e in which he elaborated on his 
statis t ical in terpretation of the wave function (Born, 1926bl lcT). In the latter, 
Bornl (jl926d . p. 174) noted, although he did not use that term at this point, 
that his probability interpretation typically le ads to the occurrence of int erfer- 
ence te rms (see note 37 for a simple example) J 20 H 21 1 These two papers by [Born 



f ll926bl Jch are mentioned much more prominently in Neue Begriindung I than 
Heisenberg's (1926b) note on fluctuations, which is cited only as prov iding an 
examp le of a special case in which there are no interference terms (j Jordan . 



1927bl . p. 812, note). 



In a two-part overview of recent developments in quantum t heory t h at ap- 
peared in Die Naturwissenschaften in July and August 1927, IJordanl ( 11927i . 
Pt. 2, p. 645) accorded Heisenberg's note a more prominent role, recognizing 
it, along with one of his own papers, as an important step toward statisti- 
cal transformation theory. Jordan's (1927a) paper was submitted about three 
weeks after Heisenberg's (1926c) but was independent of it J 22 1 The two papers 
are remarkably similar. Both are concerned with the reconciliation of two de- 
scriptions of the energy exchange between two quantum systems, a continuous 
description in terms of a mechanism similar to beats between two waves and 
a description in terms of quantum jumps (cf. note 8). Both argued, against 



20 



Acc ording to IBornl ()1926d . p. 174, note), the same result was obtained by iDirac 



1926). Born was probably referring to the phase averaging occurring in this paper 
Dirad. Il926l. p p. 675-677). The connection strikes us as rather tenuous. However, 
Jammer! ( 19661 . p. 290), citing Born's acknowledgment of Dirac's paper, claims that 
the result was "incorporated into the construction of the transformation theory of 
quantum mechanics." Unfortunately, Jammer does not identify which passage in 
Dirac's paper he takes Born to be referring to. 

21 Had Heisenberg not resented B orn's use of w ave mechanics so strongly, he could 
not have helped but notice that IBornl ()1926d ) had already proposed to interpret 
elements of transformation matrices as probabilities before he himself did in his note 
on fluctuations (Born's paper appeared December 6, 1926, i.e., after submission but 
before publication of Heisenberg's note). Born wrote: 11 If because of a perturbation, 
persisting for a finite time T, the system is taken from one state to another, then 
the distribution constants, the squares of which give the states' probabilities [i.e., the 
coefficients c n in ip(q) = J2 n c ™ VVi(<z)]j undergo an orthogonal transformation^ C n = 

of the coeffi cients are the transition 



The 



squares 



1926c], p. 176; emphasis in the 



probabilities^ 4> nm = \b nm \ 2 , *52 m $nm = 1 (IBornl . 
original; quoted by Mehra and Rechenberg, 2000-2001, p. 52). 
22 Jordan's paper was received by Zeitschrift fur Physik on November 25, 1926, 
and published January 2, 1927; Heisenberg's, as we already noted, was received 
November 6 and published December 20, 1926. In the letter from which we already 
quoted several passages above, Heisenberg praised Jordan's paper: "Your resonance 
example is very pretty" (Heisenberg to Jordan, November 24, 1926, AHQP). 
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SchrodingerJ 23 1 that despite ap pearances to the contrary quantum jumps are 



api 

unavoidable. As I Jordan! (Il927il . Pt. 2, pp. 645-646) concluded in his Naturwis- 
senschaften article, the probabilistic nature of the laws of quantum mechanics 
is key to reconciling the continuous and discontinuous descriptions. In the 
letter Heisenberg mentioned above (see note 18), Pauli had drawn the same 
conclusion from his analysis of an ingenious example of his own device that he 
told H e isenb erg was "a pure culture of your favorite resonance phenomenon'' 
f lPaulil . " 



1979 



p. 344). Instead of two quantum systems exchanging energy, 
Pauli considered one quantum system, a particle constrained to move on a 
closed ring, which periodically encounters a small obstacle. He considered the 
case in which the particle is in a state in which it should alternate between ro- 
tating clockwise and rotating counterclockwise. That was only possible, Pauli 
explained, if we accept the conclusion of Born's analysis of quantum collisions 
that there is a definite probability that the system reverses course upon hit- 
ting the obstacle, which classically would not be large enough to change the 
directio n of the system's rotation. Unlike Pauli in this letter or iHeisenberg 
( I1926bl ) in his note on fluctuations, iJordanl (Il927al ) did not elaborate on ex- 
actly how the statistical element in his example of a resonance phenomenon 
should formally be introduced, via wave functions a la Born or via transfor- 
mation matrices a la Heisenberg. In the Nat u rwisse nschaften article, he did 
not resolve this issue either. Instead, IJordanl ( 1l927il Pt. 2, p. 646) wrote in 
the paragraph immediately following his discussion of resonance phenomena 
that the statistical nature of the quantum laws "manifests itself in many ways 
even more impressively and intuitively" in Born's (1926b) analysis of quan- 
tum collisions. This strongly suggests that Jordan's statistical interpretation 
of quantum mechanics in Neue Begriindung I owed more to Born's statistical 
interpretation of wave functions than to Heisenberg's statistical interpretation 
of transformation matrices. 



Jordan was particularly taken with Pauli's further development of Born's ideas 
even though he probably did not see the letter with Pauli' s probability inter- 
pretation of |^(p)| 2 rfp mentioned above (see note 18). As iRechenbergj (120101 . 
p. 54 4) points ou t, citing a letter from Heisenberg to Pauli of October 28, 



1926 f lPaulil . Il979l . p. 349), Pauli's letter made the rounds in Copenhagen and 



23 Like Heisenberg, Jordan profoundly disliked Schrodinger's interpretation of the 
wave function while appreciating wave mechanics as providing a re servoir of math- 
ematical tools in the service of matrix mechanics. Jordan ( 1927fl ) bluntly stated 
this opinion in his review of a volume collecting Schrodinger's papers on wave me- 
chanics (Schrodinger, 1927; for a long quotation from Jordan's review, see Mehra 
and Rechenberg, 2000-2001, pp. 59-60). Although Jordan did not mention Born's 
interpretation of the wave function in his review (he only criticized Schrodinger's), 
he did not share Hei senberg ' s initia l resistance to Born's work. In his habilitation 
lecture, for instance, Jordan ( 1927dl . p. 107) praised the "very clear and impressive 
way" in which Born had introduced this interpretation. 
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Dirac, for instance, certainly read it, but Jorda n was in Gottingen at the time 
and would not have had access to it. Instead, IJordanl (jl927bl . p. 811, note) 
referr e d to a forthcoming paper by Pauli on gas degeneracy. In that paper, 
Paulil (Il927al . p. 83, note) only suggested that \i/)(q) \ 2 dq (with q = (q 1 , . . . , qj), 
where / is the number of degrees of freedom of the system) be interpreted as 
the probability of finding the system at a position somewhere in the region 
(q, q + dq). As Jordan emphasized in the Naturwissenschaften article discussed 
above , even t his suggestion is subtly different from what Born had proposed!" 24 ! 
Bornl (|1926al Jbl) considered probabilities such as the probability that a system 
has a particular energy or the probability that an electron scatters in a partic- 
ular direction (or, equivalently, has a certain momentum). Those probabilities 
are given by the absolute squares of coefficients in some expansion of the wave 
function (in configuration space), either as a superposition of energy eigen- 
states or, in the case of collisions, as a superposition (asymptotically) of wave 
functions for free particles with specified outgoing momenta. Pauli seems to 
have been the first to ask about the probability that a system has a particular 
position. That probability is given by the absolute square of the wave function 
itself rather than by the absolute square of the coefficients in some expansion 
of it, 



25 



The generalization in Pauli's letter to Heisenberg from wave functions 
in configuration space to wave functions in momentum space is not mentioned 
in Neue Begriindung I. It is certainly possible, however, that Pauli had shared 
this generalization with Jordan, as the two of them regularly saw each other 



during this period and even went on vacation together J I As a result, it is 
impossible to reconstruct what exactly Jordan got from Pauli, whose name 
occurs no less than seven times in the first three pages of Neue Begriindung I. 



Jordan wanted to use what he had learned from Pauli to provide a new uni- 
fied foundation for the laws of quantum mechanics by showing that t hey can 
be der ived as "consequences of a few simple statistical assumptions" (I Jordan . 
1927bl . pp. 810-811). The central quantities in Jordan's formalism are what 
he called "probability amplitudes." This echoes Born's (1926b, p. 804) term 
"probability waves" for Schrodinger wave fu nctions but is an extremely broad 
generalization of Born's concept. Moreover, iJordan fll927b[ p. 811) credited 
Pauli rather than Born with suggesting the term. Jordan defined a complex 
probability amplitude, ip(a,b), for two arbitrary quantum-mechanical quanti- 



24 Discussing "the statistical int erpretations of the Schrodinger function given by 
Born and Pauli," IJordanl (|1927il . Pt. 2, p. 647) added parenthetically: "(it should 



be emphasized that Born's hypothesis and Pauli's hypothesis, though related, are 
at first independent of one another)." Unfortunately, Jordan did not explain what 
he meant by the qualification "at first [zunachst]" in this clause. 
25 Although Born did not give the probability interpretation of \i^{q)\ 2 dq in his 
papers on collision theor y, he d i d state a completeness relation highly suggestive of 



it: / \il>{q)\ 2 dq = £ |c n | 2 (|Bornl . Il926bl . p. 805). 

36 AHQP in terview with J ordan, session 3, p. 15, cite d by iMehra and Rechenberg 



(12000-20011 . p. 66) and bv lDuncan and Janssenl (|2009l . p. 359). 
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ties a and b with fully continuous spectra[f_| At this point, he clearly labored 



under the illusion that it would be relatively straightforward to generalize his 
formalism to cover q uantities with w holly or partly discrete spectra as well. In 
Neue Begriindung II, iJordanl (Il927gi ) would discover that such a generalization 
is highly problematic. Unaware of these complications and following Pauli's 
lead, Jordan interpreted \(p(a,b)\ 2 da as the conditional probability Pr(a|6) for 
finding a value between a and a + da for a given that the system under con- 
sideration has been found to have the value b for the quantity b. 



28 



Eigenfunctions iPe{<i) with eigenvalues E for some Hamiltonian of a one- 
dimensional system in configuration space are examples of Jordan's probability 
amplitudes (f(a, b). The quantities a and b in this case are the position q and 
the Hamiltonian H, respectively. Hence \ipE{(l)\ 2 dq = \<p(q, E)\ 2 dq gives the 
conditional probability Yi(q\E) that q has a value between q and q + dq given 
that H has the value E. This is the special case of Jordan's interpretation 
given in Pauli's paper on gas degeneracy (in / dimensions). 

As indicated above, Jordan took an axiomatic approach in his Neue Begriin- 
dung paper s. Here we cl early see the influence of the mathematical tradition in 
Gottingen (jLackil . 120001 ) . In fact, Jordan had been Richard Courant's assistant 
before becoming Born's. In his Neue Begriindung papers, Jordan began with a 
series of postulates about his probability amplitudes and the rules they ought 
to obey (the formulation and even the number of these postulates varied) and 
then developed a formalism realizing these postulates. 

A clear description of the task at hand can be found in a paper by David 
Hilbert, Lothar Nordheim, and the other main protagonist of our story, John 
von Neumann, who had come to Gott ingen in the fall of 1926 on a fellow ship 
of the International Education Board ( iMehra and Rechenbergi . 12000-200 1[ pp. 
401-402). Born in 1903, von Neumann was a year younger than Dirac and Jor- 
dan, who, in turn, were a year younger than Heisenberg and two years younger 
than Pauli. The paper by Hilbert, von Neumann, and Nordheim grew out of 
Hilbert's course on quantum mechanics in 1926/1927 for which Nordheim pre- 



pared most of the notes|_f_| The course concluded with an exposition of Neue 



27 This is the first of several instances where we will enhance Jordan's own notation. 
In Neue Begriindung I, Jordan used different letters for quantities and their values. 
We will almost always use the same letter for a quantity and its values and use 
a hat to distinguish the former from the latter. The main exceptio n will be the 
Hamiltonian H and the energy eigenvalues E. As we noted above, iDirad (|1927l ) 
used primes to distinguish the value of a quantity from the quantity itself. 

28 The notation Pr(a|6) is ours and is not used in the sources we discuss. At this 
point, we suppress another wrinkle that will loom large in our paper, the so-called 
'supplementary amplitude' (Erg dnzung Samplitude; see, in particular, Section 2.4). 

29 See p. 13 of the transcript of the int erview with Nordheim for th e Archive of the 
History of Quantum Physics (AHQP) (jDuncan and Janssenl . I2009I . p. 361). 
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Begriindung I (ISauer and Majerl . 120091 . pp. 698-706). The notes for this part 
of the course formed the basis for a paper, which was submitted in April 1927 



but not published, for whatever reason, until the beginning of 1928 J__J In the 



introduction, the authors described the strategy for formulating the theory: 

One imposes certain physical requirements on these probabilities, which are 
suggested by earlier experience and developments, and the satisfaction of 
which calls for certain relations between the probabilities. Then, secondly, 
one searches for a simple analytic al apparatus in which quantities occur 
that satisfy these relations exactly (iHilbert. von Neumann, and Nordheiml . 



19281 . p. 2-3; cf. Lacki, 2000, p. 296). 



After everything that has been said so far, it will not come as a surprise that 
the quantities satisfying the relations that Jordan postulated for his prob- 
ability amplitudes are essentially^} the transformation matrices central to 



Dirac's (1927) presentation of the statistical transformation theory. An exam- 



ple, based on lecture notes by Dirac 
this works. 



32 



will give a rough illustration of how 



One of the features that Jordan saw as characteristic of quantum mechanics 
and that he therefore included among his postulates was that in quantum me- 
chanics the simple addition and multiplication rules of ordinary probability 
theory for mutually exclusive and independent outcomes, respectivel y, apply 
to prob ability amplitudes rather than to the probabilities themselves. [Jordan 
(jl927bl . p. 812) used the phrase "interference of probabilities" for this feature. 
He once agai n credited Pauli with the name for this phenomenon, even though 
Born ( 1926bl . p. 804) had already talked about the "interference of . . . "prob 
ability waves" " in his paper on quantum collisions. As we saw above, iBori 
(Il926a . p. 174) had discussed the feature itself in a s ubsequent pape r, albeit 
only in the special case of Schrodinger wave functions. iJordanl (|1927bl ) was the 
first, at least in print, who explicitly recognized this feature in full general- 
ity. It is implicit in Dirac's (1927) version of transformation theory, but Dirac 
may well have shared the skepticism about the interference of probabilities 



30 



Jammed (119661 . pp. 310-312) talks about the "Hilbert-Neumann-Nordheim trans- 
formation theory" as if it were a new version of the theory, going beyond "its 
predecessors, the theories of Dirac and Jordan" (ibid., p. 312). However, what Jam- 
mer sees as the new element, the identification of probability amplitudes with the 
kernels of certain integral operators (ibid., p. 311), is part and parcel of Jordan's 
papers (see note 14 as well as Sections 2 and 4). 

31 The qualification is related to the complication mentioned in note 28 and will be 
explained at the end of this subsection. 

32 AHQP, "No tes for Dira c 's firs t lecture c ourse on q uantu m mechanics, 115 pp. 



[Oct 1927?]" (jKuhn et all Il967l . p. 32). As iDarrieoll (|199l p. 344) notes, "Dirac 



did try [on pp. 6-9 of these lecture notes] to lay out his competitor's theory, but he 
quickly returned to his own, which he found simpler and more elegant." 
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that Heisenberg expressed in a letter to Jordan (we will quote and discuss the 
relevant passage below). 



As we will see in Section 2.1, Jordan's postulate about the addition and mul- 
tiplication of probability amplitudes basically boils down to the requirement 
that the amplitudes y?(a,c), ij)(a,b) and x(°; c ) for the quantities a, b, and c 
with purely continuous spectra satisfy the relation ip(a, c) = J db ip(a, b) x(b, c). 
It is easy to see intuitively, though much harder to prove rigorously (see Section 
1.2), that this relation is indeed satisfied if these three amplitudes a re equated 



with the transformation matrices — in the notation of iDirad ( 119271 ) explained 
above — (a/c), (a/6), and (b/c), respectively. These matrices relate wave func- 
tions in a-, b-, and c-space to one another. From ip(a) = J db (a/b) ip(b) and 
ip(b) = J dc{b/c) ip(c), it follows that 

^(a) = J J dbdc{a/b) (6/c)^(c). 

Comparing this expression to 4>(a) = J dc(a/c)ip(c), r we see that (a/c) 
/ db(a/b) (b/c), in accordance with Jordan's postulates 



33 



This brings us to an important difference between Jordan's and Dirac's ver- 
sions of their statistical transformation theory. For Dirac, the transformation 
element was primary, for Jordan the statistical element was. Most of Dirac's 
(1927) paper is devoted to the development of the formalism that allowed him 
to represent the laws of quantum mechanics in different yet equivalent ways 
(sees. 2-5, pp. 624-637). The probability interpretation of the transformation 
matrices is then grafted onto this formalism in the last two sections (ibid., 
sees. 6-7, 637-641). Jordan's (1927b) paper begins with the axioms about 
probability (Pt. I, sees. 1-2, pp. 809-816). It is then shown that those can be 
implemented by equating probability amplitudes with transformation matri- 
ces, or, to be more precise, with integral kernels of canonical transformations 
(Pt. 2, sees. 1-6, pp. 816-835). 

Heisenberg strongly preferred Dirac's version of statistical transformation the- 
ory over Jordan's. For one thing, he disliked Jordan's axiomatic approach. As 
he told Kuhn in his interview for the AHQP project: 



Jordan used this transformation theory for deriving what he called the ax- 
iomatics of quantum theory . . . This I disliked intensely . . . Dirac kept 
within the spirit of quantum theory while Jordan, together with Born, went 



33 Dirac lecture notes, p. 7 (cf. note 32). Dirac used a, f3, and 7 instead of a, b, and 
c, and considered a purely discrete rather than a purely continuous case, thus using 
sums instead of integrals. Dirac actually used the notation (.|.) in his handwritten 
letters and manuscripts during this period, but apparently did not mind that this 
was rendered as (./.) in print. 
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into the spirit of the mathematicians (AHQP interview with Heisenberg, 
session 11, pp. 7-8; quoted by Duncan and Janssen, 2009, pp. 360-361). 



Presumably, the axiomatic approach in and of itself would not have presented 
too much of an obstacle for Heisenberg. As Paul Ehrenfest told Jordan, who 
still remembered it with amusement when interviewed decades later: "Since 
you wrote the paper axiomatically, that only means that one has to read it 



back to front. "I I In that case, one would encounter probability amplitudes in 



the guise of transformation matrices first, as in Dirac's version of the theory. 

Heisenberg had more serious reservations about Jordan's version of theory, 
which initially made it difficult for him even to understand Neue Begriindung 
I. He expressed his frustration in a letter to Pauli a few weeks after the paper 
was published. After praising Jordan's (1927d) habilitation lecture which had 
just appeared in Die Naturwissenschaften, Heisenberg wrote: 

I could not understand Jordan's big paper in [Zeitschrift fur Physik]. The 
"postulates" are so intangible and undefined, I cannot make heads or tails 
of them (Heisenberg to Pauli, February 5, 1927; Pauli, 1979, p. 374) I 3 ^] 

About a month later, Heisenberg wrote to Jordan himself, telling him that he 
was working 

on a fat paper [Heisenberg, 1927b, on the uncertainty princple] that one 
might characterize as physical commentary on your paper and Dirac's. You 
should not hold it against me that I consider this necessary. The essence 
from a mathematical point of view is roughly that it is possible with your 
mathematics to give an exact formulation of the case in which p and q are 
both given with a certain accuracy (Heisenberg to Jordan, March 7, 1927, 
AHQP; emphasis in the original) 



36 



Heisenberg now had a clearer picture of Jordan's approach and of how it dif- 
fered from the approach of Dirac, who had meanwhile left Copenhagen and 



34 AHQP in terview wit h Jordan, session 3, p. 17; qu oted bv lMehra and Rechenberg 
(j2000-200ll . p. 69) and buncan and Jansser] (l2009l . p. 360). 



35 Even Dirac's (1927) paper, which for modern readers is much more accessible than 
Jordan's Neue Begriindung I, was difficult to understand for many of his contempo- 
raries. In a letter of June 16, 1927, Ehrenfest told Dirac that he and his colleagues 
in Leyden had had a hard time wi th several of his papers, including the one on 
transformation theory (jKraghl . Il990l . p. 46). 

36 Heisenberg's (1927b) paper was received by Zeitschrift fur Physik on March 23, 
1927. Shortly before he submitted the paper, he wrote to Jordan for help with a sign 
error in a derivation in Neue Begriindung I that he needed for his paper (see note 71). 
For discuss ion of He i senbe rg's reliance on transformation theory fo r his uncerta inty 
paper, see Jammer (jl966l . pp. 326-328), Mehra and Rechenberg ( 2000-2001 . pp. 



148-151, and pp. 159-161), and Beller (1985, 1999, pp. 91-95). 
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had joined Born and Jordan in Gottingen. After registering some disagree- 
ments with Dirac, Heisenberg turned to his disagreements with Jordan: 



With you I don't quite agree in that, in my opinion, the relation / <p(x, y) 
ijj(y, z) dz has nothing to do with the laws of probability. In all cases in 
which one can talk about probabilities the usual addition and multiplication 
of probabilities is valid, without "interference." With Dirac I believe that 
it is more accurate to say: all statistics is brought in only through our 
experiments (Heisenberg to Jordan, March 7, 1927, AHQP; emphasis in the 
original). 

The reservation expressed in the first two sentences is largely a matter of se- 
mantics. Heisenberg did not dispute that the relation given by Jordan, which 
we wrote as <p(a,c) = J dbip(a,b) x(b,c) above, holds in quantum mechanics 
(as we will see in Section 1.2, it corresponds to the familiar completeness and 
orthogonality relations). Heisenberg also did not dispute that this relation 



describes interference phenomenaJ 37 1 What Heisenberg objected to was Jor- 
dan's way of looking upon this relation as a consequence of his addition and 
multiplication rule s for probability amplitudes. This did not materially affect 



Jordan's theory as Jordan! ( 1927b . p. 813) only ever used those rules to derive 



this particular relation 



38 



The reference to Dirac for Heisenberg's second reservation is to the concluding 
sentence of Dirac's paper on transformation theory: 

The notion of probabilities does not enter into the ultimate description of 
mechanical processes: only when one is given some information that i nvolves 



a pro bability . . . can one deduce results that involve probabilities ((Dirac 



19271 . p. 641) 



39 



37 A simple example, the familiar two-slit experiment, immediately makes that clear. 
Let the quantity c (value c) be the position where photons are emitted; let b (values 
&i and b 2 ) be the positions of the two slits; and let a (value a) be the position where 
photons are detected. On the assumption (vindicated by modern theory) that the 
relation also applies to quantities with discrete values (such as b in this example), 
the integral on its right-hand side reduces to a sum of two terms in this case: 
f(a,c) = ip(a, b\) x(bi, c) + ip(a, b 2 ) x(&2 5 c). Multiplying the left- and the right- 
hand side by their complex conjugates, we find that the probability Pr(a|c) is equal 
to the sum of the probabilities Pr(a&&i|c) and Pr(a& b 2 \c) plus the interference 
terms ip(a, bi)ip*(a, b 2 ) x(h,c) x*(^2, c) and ip(a, b 2 ) tp*(a, h) x(h,c) x*(h,c). 

38 Heis enberg elaborated on his criticism of Jordan on this score in his uncertainty 
paper ( Heisenberg! . 1927bl . pp. 183-184, p. 196). Von Neumann (1927a, p. 46; cf. note 



128) initially followed Jordan (see also Hilbert, von Neumann, and Nordheim, 1928, 
p. 5; c f. note 93), but he cha nged his mind after reading Heisenberg's uncertainty 
paper ( von Neumann . 1927bl , p. 246 ; cf. no te 133). 



39 , 



Quoted and discussed by iKraghl (jl99dl . p. 42). Dirac's position, in turn, was 



undoubtedly influenced by the exchange between Pauli and Heisenberg that led to 
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Instead of elaborating on this second reservation, Heisenberg told Jordan that 
he had just written a 14-page letter about these matters to Pauli and suggested 
that Jordan have Pauli send him this letter for further details. In this letter, 
the blueprint for his uncertainty paper, Heisenberg told Pauli: 

One can, like Jordan, say that the laws of nature are statistical. But one 
can also, and that to me seems considerably more profound, say with Dirac 
that all statistics are brought in only through our experiments. That we 
do not know at which position the electron will be the moment of our 
experiment, is, in a manner of speaking, only because we do not know the 
phases, if we do know the energy . . . and in this respect the classical theory 
would be no different. That we cannot come to know the phases without 
. . . destroying the atom is characteristic of quantum mechanics (Heisenberg 
to Pauli, February 23, 1927; Pauli, 1979, Doc. 154, p. 377; emphasis in the 
original; see also Heisenberg, 1927b, p. 177). 

So Heisenberg — and with him Dirac — held on to the idea that nature itself is 
deterministic and that all the indeterminism that quantum mechanics tells us 
we will encounter is the result of unavoidable disturbances of nature in our 
experiments. Jordan, by contrast, wanted at least to keep open the possibility 
that nature is i ntrinsic a lly ind eterministic (as did Born [1926a, p. 866; 1926b, 



pp. 826-827]). iJordanl ( 11927dl ) made this clear in his habilitation lecture 



10 



the latter's uncertainty paper. Recall, for insta nce, that Dir ac read the long letter 
from Pauli to Heisenberg of October 19, 1926 (jPaulil . Il979l . quoted and discussed 



by Mehra and Rechenberg, 2000-2001, pp. 145-147). 

40 This lecture, published in Die Naturwissenchaften and, in a translation prepared 
by Oppenheimer, in Nature, shows that Jordan's position was more nuanced than 
Heisenberg made it out to be. First, Jordan clearly rec ognized that th e laws govern- 
ing his probability amplitudes are fully deterministic ( Jordan! . 1927dl . pp. 107-108 



in the German original; pp. 567-568 in the English translation). Second, whether 
nature is indeterministic remained an open question for him. As an example he 
considered the time of a single quantum jump, which, he wrote, "under certain cir- 
cumstances is a measurable quantity. What predictions can our theory make on this 
point? The most obvious answer is that the theory only gives averages, and can tell 
us, on the average, how many quantum jumps will occur in any interval of time. 
Thus, we must conclude, the theory gives the probability that a jump will occur at 
a given moment; and thus, so we might be led to conclude, the exact moment is 
indeterminate, and all we have is a probability for the jump. But this last conclusion 
does not necessarily follow from the preceding one: it is an additional hypothesis" 
(ibid., p. 109 [German]; p. 569 [English]). In general, he reiterated in the conclusion 
of his habilitation lecture, "[t]he circumstance that the quantum laws are laws of 
averages, and can only be applied statistically to specific elementary processes, is 
not a conclusive proof that the elementary laws themselves can only be put in terms 
of probability . . . Probably we shall find that an incomplete determinism, a certain 
element of pure chance, is intrinsic in these elementary physical laws. But, as I have 
said, a trustworthy decision will only be possible after a further analysis of quantum 
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In his Neue Begriindung papers, Jordan did not discuss the nature of the 
probabilities he introdu ced. He did not even pr operly define these probabilities. 
This was done only by Ivon Neumann! (jl927bl ) with the help of the notion of 
ensembles of systems from which one randomly selects members, a notion 
devel oped by R i chard von Mises and published in book form the following 
year (Ivon Misesl . Il928l . see note 134). 



As far as the issue of determinism versus indeterminism is concerned, Dirac 
thus stayed closer to classical theory than Jordan. In other respects, however, 
Jordan stayed closer. Most importantly, Jordan's use of canonical transfor- 
mations is much closer to their use in classical mechanics than Dirac's, 41 In 



the letter to Jordan from which we quoted right at the beginning of our pa- 
per, Dirac clearly identified part of the difference in their use of of canonical 
transformations: 



In your work I believe you considered transformations from one set of dy- 
namical variables to another, instead of a transformation from one scheme 
of matrices representing the dynamical variables to another scheme repre- 
senting the same dynamical variables, which is the point of view adopted 
throughout my paper. The mathematics appears to be the same in the two 
cases, however (Dirac to Jordan, December 24, 1916, AHQP) F^l 

Traditionally, canonical transformations had been used the way Jordan used 
them, as transformations to new variables, and not the way Dirac used them, 
as transformations to new representations of the same variables. Canonical 
transformations had been central to the development of matrix mechanics 
(Born and Jordan, 1925; Born, Heis enberg, and Jor dan, 1926; see Section 2.2 
below). Prior to Neue Begriindung I, Jordan ( 1926bl lcl) had actually published 
two important pap ers on the implementation of canonical t ransformations in 
matrix mechanics (lLackil . l2004j ; iDuncan and Janssenl . l2009h . Jordan's use of 
canonical transformations in his Neue Begriindung papers was twofold. First, 
as already mentioned above, Jordan tried to show that integral kernels in 
canonical transformations have all the properties that probability amplitudes 
must satisfy according to his postulates. Second, he tried to use canonical 
transformations to derive differential equations for probability amplitudes for 
arbitrary quantities (such as the time-independent Schrodinger equation for 
ip(q, E) = tpEiq)) from the trivial differential equations satisfied by the proba- 



m echanics" (ibid., p. 110 [German]; p. 569 [English]). 



li 



Darrigoll (jl992l . p. 343) also recognizes this but, pointing to Jordan's unusual 



definition of canonically conjugate quantities (see below), concludes that "Jordan 
departed much more from the classical model than did Dirac." This may be true, 
as we will see, for where Jordan ended up with Neue Begriindung II, but he started 
out following the classical model much more closely than Dirac. 



This is t he one paragraph of this letter not quoted by iMehra and Rechenberg 
(l2000-200ll . cf. note 2). 
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bility amplitude for p(p, q) for some generalized coordinate q and its conjugate 
momentum p that he started from. The 'mind your p's and g's' part of the 
title of our paper refers to the crucial role of canonical transformations and 
conjugate variables in Jordan's formalism. 



Both ways in which Jordan relied on canonical transformations in his Neue 
Begriindung papers turned out to be problematic and resulted in serious math- 
ematical problems for his version of statis tical transfor mation theory. These 
problems do not affect Dirac's version since iDirad (119271 ) only relied on canon- 
ical transformations in very loose sense. However, as we will see once we have 
introduced some modern tools to analyze the Dirac- Jordan theory in Section 
1.2, the two versions of the theory do share a number of other mathematical 
problems. The 'never mind your p's and g's' part of our title thus does not refer 
to Dirac's version of statistical transforma tion theory, bu t to the fundamen- 
tally different Hilbert space formalism that von Neumann! ( 1927a ) introduced 
as an alternative to the Dirac- Jordan theory (Section 1.3). 



To conclude this subsection, we briefly indicate how Jordan ran into prob- 
lems with his twofold use of canonical transformations in Neue Begriindung 
and explain why those problems do not plague Dirac's version of the theory. 
We return to these problems in Section 1.2, relying more heavily on modern 
concepts and notation, and analyze them in greater deta il in Se ctions 2 and 
4, again helping ourselves to modern tools. As iHeisenbergl (119601 p. 44) wrote 
in the passage from his contribution to the Pauli memorial volume quoted 
above, Dirac only considered unitary transformations in his transformation 
theory. Unitary transformations of Hermitian operators preserve their Her- 
miticity. Un fortunately, there are many canonical transformations that are 



not unitary ( IDuncan and Janssenl . 120091 p. 356). In Neue Begriindung I, Jor- 



dan therefore tried to develop a formalism that would allow quantities that, 
from a modern point of view, correspond to non-Hermitian operators. As long 
as the quantities a and b correspond to Hermitian operators, the probability 
amplitude f(a, b) is the integral kernel of a unitary transformation and the 
conditional probability Pr(a|6) is equal to \ip(a,b)\ 2 da. In such cases, the inte- 
gral kernel ip(a, b) is simply identical to Dirac's unitary transformation matrix 
(a/b). As soon as a and/or b are non-Hermitian, however, ip(a, b) is the integral 
kernel of a non-unitary transformation and the expression for Pr(a|6) becomes 
more complicated, involving both the probability amplitude itself and what 
Jordan cal led a "supplementary amplitude" (Erqanzunqs amplitude). Following 
the lead of lHilbert. von Neumann, and Nordheiml ( 119281 ). Jordan dropped the 
Erg dnzung Samplitude in Neue Begriindung II. The price he paid for this was 
what, from the point of view of classical mechanics, amounted to the rather 
arbitrary restriction of the canonical transformations allowed in his theory to 
unitary ones. 
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The other main problem that Jordan ran into with his use of canonical trans- 
formations is directly related to his axiomatic approach. Unlike Dirac, Jordan 
wanted to derive the entire theory from his statistical postulates. This meant, 
for instance, that Jordan did not include the canonical commutation rela- 
tion, [p,q] = pq — qp = hjlixi (where h is Planck's constant), among his 
postulates. Nor did he assume the usual association of the momentum p con- 
jugate to q with the differential operator {h/i)d/dq (with h = h/2n) acting 
on wave functions in g-space. Dirac as sumed both the se elements. Instead of 
the commutation relation for p and q, Jordan! ( 1927b . p. 814) assumed that 
the probability amplitude p(p, q) for these two quantities had a particularly 
simple form — in our notation: p{p, q) = e~ ipq / h . This then is how Planck's 
constant enters Jordan's formalism. Normally, the canonical commutation re- 
lation \p, q] = h/i defines what it means for p and q to be canonically conjugate 
variables. Jordan, however, defined p and q to be canonically conjugate if and 



only if p{p, q) = e~ ipq ' n ell This probability amplitude tells us that "[fjor a 
given value of q all possible values of p are equally probable" (Jordan, 1927bl . 
ibid, p. 814; emphasis in the original; hats added)cU The amplitude p{p,q) 
trivially satisfies the equations {p+{h/i) d/dq) p = and {{h/i) d/dp+q) p = 
(ibid.). Subjecting these basic equations to canonical transformations, Jordan 
argued for the usual association of quantum-mechanical quantities with dif- 
ferential operators acting on wave functions (e.g., p with {h/i)d/dq). Once 
those associations have been made, his assumption about the form of p{p, q) 
entails the usual commutation relation for p and q. Performing more canonical 
transformation on the basic equations for p{p,q), Jordan also tried to derive 
differential equations for probability amplitudes involving quantities related to 
the initial p and q via canonical transformations. In this manner, for instance, 
he tried to recover the time-independent Schrodinger equation. 



43 When Kuhn asked Jordan in his interview for the AHQP project why he had 
chosen this definition, Jordan said: "I wanted to put the probability considerations 
first and thought that the multiplication, the operators, would follow from these. 
If one posits that p and q are two quantities for which there is this particular 
probability [amplitude] then one can draw conclusions from that and if one defines 
multiplication afterwards, then one gets the commutation rules. The goal was to 
not make the commutation rules the starting point but to obtain them as the result 
of something else. The basic rules of the formulation should be conceived of as 
statements about probability amplitudes" (session 3, pp. 16-17). 

44 Jordan's habilitation lecture published about two months before Heisenberg's 
(1927b) paper (see note 36), contains a statement that is even more suggestive 
of the uncertainty principle: "With different experimental setups one can observe 
different coordinates. However, with a particular setup one can, at best, observe 
particular coordinates of an atom exactly, while it will then be impo ssible in that 
setup to observe the corresponding momenta exactly" ( Jordan . 1927dl . p. 108, note 
1, in the German version; this note was omitted in the English translation). 
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It was not until Neue Begriindung II, written in Copenhagen on an Interna- 
tional Education Board fellowship and received by Zeitschrift fur Physik on 
June 3, 1927, that Jordan realized that this strategy for deriving Schrddinger- 
type differential equations for his probability amplitudes was severely limited. 
He came to this realization as he was trying to extend his formalism, which 
in Neue Begriindung I was restricted to quantities with continuous spectra, 
to cover quantities with partly or wholly discrete spectra as well. The key 
problem (as we will show using modern tools at the end of Section 2.3), is 
that quantities related by a canonical transformation have the same spec- 
trum. It follows that Jordan's procedure to get from the differential equations 
for probability amplitudes for one pair of quantities to those for another pair 
fails as soon as one pair has purely continuous spectra (such as p and q) and 
the other pair has partly or wholly discrete spectra (such as, typically, the 
Hamiltonian) . In response to this problem, Jordan changed the way he used 
canonical transformations to the way Dirac had been using them all along, 
where the transformations are no lon ger to new quan tities but rather to new 
representations of the same quantities (|Jordanl . ll927gi . pp. 16-17; we will quote 
the relevant passage in Section 4). 



The treatment of quantities with partly or wholly discrete spectra in Neue 
Begriindung II necessitated further departures from the classical formalism of 
canonical transformations and conjugate variables. For quantities with fully 
continuous spectra, Jordan could show that his definition of conjugate vari- 
ables in terms of a probability amplitude reduces to the standard definition 
in terms of a commutation relation. This is not true for quantities with partly 
or wholly discrete spectra. In Neue Begriindung II, Jordan gave a simple 
proof that such quantities can never satisfy the standard commutation re- 
lation [p,q] = h/i (see Section 4, Eqs. (103)-(108)). Jordan presented it as a 
point in favor of his formalism that his alternative definition of canonically 
conjugate variables works for quantities with partly or wholly discrete spectra 
as well. Jordan's definition, however, led to such counter- intuitive proposi- 
tions as different components of spin qualifying as canonically conjugate to 
one another 45 



In fact, Jordan's initial answer to Kuhn's question why he had adopted a new 
definition of canonically conjugate variables (see note 43) was that it would be 
applicable to spin as well. Only when Kuhn reminded him that this application 
must have come later, did Jordan give the answer we quoted in note 43. Later in 
the interview, Kuhn explicitly asked him how he felt about the idea of taking two 
components of spin to be conjugate to one another. "Yes, wasn't that neat," Jordan 
answered, "that was a success, wasn't it, that was pretty that that worked, that one 
could do it this way, a sensible generalization of p and q" (session 3, p. 22). As Kuhn 
notes in defense of Jordan, the breakdown of the ordinary commutation relations 
did make room for Jordan's introduction of anti-commutation relations (ibid., p. 
23; cf. Jordan, 1927h; Jordan and Wigner, 1928; and the coda to our paper) 
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All in all, the state of affairs by the end of Neue Begriindung II can fairly 
be characterized as follows. Jordan was still trying to rely on the classical 
formalism of canonical transformations and conjugate variables to build up 
the formalism realizing the postulates for his quantum-mechanical probability 
amplitudes. However, these classical concepts had to be stretched almost be- 
yond the breaking point to arrive at a satisfactory formulation of his quantum 
theory. 



1.2 Mathematical challenges facing the Dirac- Jordan theory 



From a modern point of view, the realization of the axioms of Neue Begriindung 
is supplied by the Hilbert space formalism!^ Jordan's probability amplitudes 
(f(a,b) are then identified with 'inner products' (a\b) of 'eigenvectors' \a) and 
| b) of operators a and b\^_\ The reason we used scare quotes in the preceding 
sentence is that for quantities with completely continuous spectra, to which 
Jordan restricted himself in Neue Begriindung I, the 'eigenvectors' of the cor- 
responding operators are not elements of Hilbert space. That in modern quan- 
tum mechanics they are nonetheless routinely treated as if they are vectors 
in Hilbert space with inner products such as (a\b) is justified by the spectral 
theorem for the relevant operators. 



Of course, neither the spectral theorem nor the notions of an abstract Hilbert 
space and of operators acting in it were available when Jordan and Dirac pub- 
lished their respective versions of transformation theory in 1927. The Hilbert 
sp ace formalism a n d the s pectral theorem were only introduced later that year, 
by Ivon Neumann! (|1927al ). and two more years passed before Ivon Neumann 
( 1929[ ) publ ished a rigorous proof of the spectral theorem. So even though 
Dirad ( 1927 ) introduced the notation (a/b) for what Jordan wrote as Lf(a,b), 



46 As Darrigol ( 19921 . p. 344) notes about Dirac's version of transformation theory: 
"There is one feature of Dirac's original transformation theory that is likely to sur- 
prise the modern quantum physicist: the notion of state vector is completely absent. 
It was in fact introduced later by Weyl [1927] and von Neumann [1927a, see Section 
6], and subsequently adopted by Dirac [1930] himself. In 1939 Dirac even split his 
original transformation symbol (£'/a') into two pieces and \a'), the "bra" and 
the "ket" vectors. The mathematical superiority of the introduction of state vectors 
is obvious, since it allows — albeit not without difficulty — an explicit construction of 
mathematical entities (rigged Hilbert spaces) that justify Dirac's manipulations." 
In a note added in a proof to his next paper (see Section 7) . Ivon Neumann (|l927bl . 
p. 256) acknowledged that Wevl ( 19271 ) had independently introduced the notion of 
a pure state. 

47 We will not introduce a special notation to distinguish between a physical quantity 
and the operator acting in Hilbert space representing that quantity. In most cases 
it will be clear from context whether a stands for a quantity or an operator. 
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Dirac, like Jordan, did not at that time conceive of these quantities as 'inner 
products' of two more elementary quantities (see note 46). Although Dirac 
later did accept the split (once again see note 46), probability amplitudes re- 



mained the fundamental quantities for Jordan (IDuncan and Janssenl . 120091 p. 
361). 

Once the 'inner-product' structure of probability amplitudes is recognized and 
justified with the help of the spectral theorem, Jordan's basic axioms about 
the addition and multiplication of probability amplitudes are seen to reduce 
to statements about orthogonality and completeness familiar from elementary 
quantum mechanics. For instance, as we mentioned in Section 1.1, Jordan's 
postulates demand that the probability amplitudes (p(a, c), ip(a, b) and xipi c ) 
for quantities a, b, and c with purely continuous spectra satisfy the relation 
(p(a,c) = J dbip(a,b) x(b,c). Once probability amplitudes are identified with 
'inner products' of 'eigenvectors' (appropriately normalized, such that, e.g., 
(a\a') = 5(a — a'), where 5(x) is the Dirac delta function), the familiar com- 
pleteness relation, (a|c) = / db (a\b)(b\c) (which holds on account of the reso- 
lution of unity, 1 = J db\b)(b\, corresponding to the spectral decomposition of 
the operator b), guarantees that <p(a, c) — J dbip(a, b) x(b, c). In this sense, the 
Hilbert space formalism thus provides a realization of Jordan's postulates. 

In the absence of the Hilbert space formalism and the spectral theorem, Jordan 
relied on the formalism of canonical transformations to develop the analyti- 
cal apparatus realizing his axiomatic scheme. As we saw in Section 1.1, his 
starting point was the probability amplitude p(p, q) = e~ %pq l h for some gener- 
alized coordinate q and its conjugate momentum pjjjfj This special probability 
amplitude, to reiterate, trivially satisfies two simple differential equations. Jor- 
dan then considered canonical transformations to other canonically conjugate 
variables P and Q and derived differential equations for arbitrary probability 
amplitudes starting from the ones for p(p,q). In this way, he claimed, one 
could recover both the time-independent and the time-dependent Schrodinger 
equations as examples of such equations. 

Both claims are problematic. The recovery of the time-dependent Schrodinger 
equation requires that we look upon the time t not as a parameter as we would 
nowadays but as an operator to be expressed in terms of the operators p and q. 
More importantly, Jordan's construction only gets us to the time-independent 
Schrodinger equation for Hamiltonians with fully continuous spectra. In Neue 
Begriindung I, Jordan deliberately restricted himself to quantities with com- 
pletely continuous spectra, confident at that point that his approach could 
easily be generalized to quantities with wholly or partly discrete spectra. He 
eventually had to accept that this generalization fails. The problem, as he him- 



48 In the parlance of modern quantum information theory, this is the statement that 
{\p}} and {\q)} are mutually unbiased bases. 
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self recognized in Neue Begriindung II, is that two quantities a and p related 
to each other via a canonical transformation (implemented by the similarity 



transformation at = TpT" 1 ) always have the same spectrum] 49 1 Hence, no 
canonical transformation that can be implemented in this way can take us 
from quantities such as p and q with a completely continuous spectrum to a 
Hamiltonian with a wholly or partly discrete spectrum. 

The quantities <p(a, b) in Jordan's formalism do double duty as probability 
amplitudes and as integral kernels of canonical transformations. Even if we 
accept the restriction to quantities with fully continuous spectra for the mo- 
ment, Jordan could not quite get his formalism to work, at least not at the level 
of generality he had hoped for. In hindsight, we can see that another major 
hurdle was that canonical transformations from one set of conjugate variables 
to another, although they do preserve the spectra, do not always preserve the 
Hermitian char acter of the operators assoc iated with these variables in quan- 



tum mechanics (IDuncan and Janssenl . l2009l sees. 5-6). Initially Jordan tried to 



get around this problem through the introduction of the Erganzungsamplitude. 
Once he dropped that notion, he had to restrict the allowed canonical trans- 
formations to those associated with unitary operators. In the modern Hilbert 
space formalism, the integral kernels of canonical transformations in Jordan's 
formalism are replaced by unitary operators. There is no need anymore for con- 
sidering canonical transformations nor, for that matter, for sorting quantities 
into pairs of conjugate variables. Jordan's reliance on canonical transforma- 
tions and conjugate variables became even more strained in Neue Begriindung 
II, when he tried to extend his approach to quantities with partly or wholly 
discrete spectra. He had a particularly hard time dealing with the purely dis- 
crete spectrum of the recently introduced spin observable. Minding his p's and 
g's, Jordan ended up putting himself in a straitjacket. 



1.3 Von Neumann's alternative to the Dirac- Jordan theory 



At the end of their exposition of Jo r dan's theory in Neue Begriindung I 



Hilbert. von Neumann, and Nordheiml ( 119281 . p. 30) emphasized the mathe- 



matical difficulties with Jordan's approach (some of which they had caught, 
some of which they too had missed), announced that they might return to 
these on another occasion, and made a tantalizing reference to the first of 



von 


Neumann 


von 


Neumann! 



of these papers, "Mathematical foundations (Mathematische Begriindung) of 



49 See Eq. (84) at the end of Section 2.3 for a simple proof of this claim in modern 
language. 
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quantum mechanics," is the one in which Ivon Neumann! (11927a ) introduced 
the Hilbert space formalism and the spectral theorem (he only published a 
rigorous proof of the latter two years later). One might therefore expect at 
this juncture that von Neumann would simply make the observations that we 
made in Section 1.2, namely that the Hilbert space formalism provides the 
natural implementation of Jordan's axiomatic scheme and that the spectral 
theorem can be used to address the most glaring mathematical problems with 
this implementation. Von Neumann, however, did nothing of the sort, 



50 



Von Neumann was sharply critical of the Dirac- Jordan transformation theory. 
As he put it in the introduction of his 1932 book: "Dirac's method does not 
meet the demands of mathematical rigor in any way — not even when it is re- 
duced in the natural and chea p way to the level that is common in theoretical 
physics" ( Ivon Neumann! . Il932l . p. 2; our emphasis). He went on to say that "the 
correct formulation is not just a matter of making Dirac 's method mathemati- 
cally precise and explicit but right from the start calls for a different a ppr oach 



related to Hilbert 's spectral theory of operators" (ibid., our emphasis) J 51 1 Von 



50 Von Neumann's formulation of qua ntum mechanics is non etheless often referred 
to as transformation theory. In fact, Ivon Neumannl (|1932l . p. 1) himself, in the 



introduction of his famous book, used that term to describe both his own theo r y and 
the theory of Dirac and Jordan. And the title of the section in which I Jammed (|1966l . 
pp. 307-322) covers von Neumann's formulation is "The statistical transformation 
theory in Hilbe r t spac e." At the beginning of the preceding section on Jordan and 
Dirac, Jammeil (1966, p. 293) warns his readers that "a clear-cut or universally 
accepted definition of the subject matter of the transformation theory is hardly 
found in the literature." After giving a couple of examples of widely different usages, 
he offers a possible definition: "the study of those transformations in quantum theory 
which leave the results of empirically significant formulae invariant" (ibid., p. 294). 
With this definition, there can be many different transformation theories depending 
"on the kind of space with respect to which the performance of transformations is 
to be considered" (ibid.). Jammer then characterizes the progression from Jordan 
and Dirac to von Neumann in a way that fits well with our analysis: "while the 
early phases of the development were characterized by transformations within the 
configuration and momentum space, its later elaboration led to the conception of 
abstract Hilbert spaces as the arena underlying the transformations. In this course 
the formerly important notion of canonical transformations in quantum mechanics 
gradually l ost its pec u liarity and importance" (ibid.). 

51 As Leon van Hove ( 19581 . pp. 95-96), noted in a concise and lucid summary of 
von Neumann's contribution to quantum mechanics for a special issue of the Bul- 
letin of the American Mathematical Society dedicated to his memory: "Although 
von Neumann himself attempted at first, in collaboration with Hilbert and Nord- 
heim, to edify the quantum-mechanical formalism along similar lines [i.e., those 
of the transformation theory of Dirac and Jordan], he soon realized that a much 
more natural framework was provided by the abstract, axiomatic theory of Hilbert 
spaces and their linear operators." Unfortunately, there are many highly misleading 
statements in the historical literature about the relation between the Dirac-Jordan 
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Neumann only referred to Dirac in this passage, but as co-author of the paper 
with Hilbert and Nordheim mentioned above, he was thoroughly familiar with 
Jordan's closely related work as well. He also clearly appreciated the differ- 
ence in emphasis between Dirac and Jordan. Talking about the Schrodinger 
wave function in the introduction of the second paper of his 1927 trilogy, he 
wrote: "Dirac interprets it as a row of a certain transforma tion matrix, Jor- 
dan calls it a probability amplitude" f lvon Neumann! . Il927bl p. 246) J 52 1 In the 



opening paragraph of this article, von Neumann contrasted wave mechanics 
with "transformation theory" or "statistical theory," once again reflecting the 
difference in emphasis between Dirac and Jordan. Yet, despite his thorough 
understanding of it, von Neumann did not care for the Dirac- Jordan approach. 

Von Neumann's best-known objection concerns the inevitable use of delta 
functions in the Dirac- Jordan approach. However, von Neumann also ob- 
jected to the use of probability amplitudes. Jordan's basic amplitude, p(p, q) = 
e~ ipg / h , is not in the space L 2 of square-integrable functions that forms one 
instantiation of abstract Hilbert space. Moreover, probability amplitudes are 
only determined up to a phase factor, which von Neumann thought particu- 
larly unsatisfactory. "It is true that the probabilities appearing as end results 
are invariant," he granted in the introduction of his paper, "but it is un- 
satisfactory and unclea r why this detour thro ugh the unobservable and non- 
invariant is necessary" (jvon Neumann! . Il927al . p. 3). So, rather than following 
the Jordan-Dirac approach and looking for ways to mend its mathematical 
shortcomings, von Neumann, as indicated in the passage from his 1932 book 
quoted above, adopted an entirely new approach. He generalized Hilbert's 
spectral theory of operatora 53 1 to provide a formalism for quantum mechanics 
that is very different from the one proposed by Jordan and Dirac. 

The only elements that von Neumann took from transformation theory — more 
specifically Jordan's version of it — were, first, Jordan's basic idea that quan- 
tum mechanics is ultimately a set of rules for conditional probabilities Pr(a|6), 
and second, the fundamental assumption that such probabilities are given by 
the absolute square of the corresponding probability amplitudes, which essen- 



transformation theory and von Neumann's Hilbert space formalism. iKraghl (ll990l . 
pp. 46-47) calls the latter "a m athematically advanced development of the Dirac- 
Jordan transformation theory." lLackil (|2000l . p. 301) writes that "von Neumann 
appears to follow exactly the program as set out in his previous paper with Hilbert 
and Nordheim." And in our paper on the path to Neue Begriindung, we wrote: "In 
the process of providing sound mathematical underpinnings of Jordan's transforma- 
tion theory, von Neumann introduced the i dea of representing quantum -mechanical 
states by vectors or rays in Hilbert space" ( Duncan and Janssen . 20091. p . 361). 
52 Discussing Jordan's approach in his first paper, von Neumann ( 1927al ) referred 
to "transformation operators (the integral kernels of which are the "probability 
ampli tudes" ) " fp. 3). 

See Steen (1 19731 ) for a brief history of spectral theory. 
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tially boils down to the Born rulejj^j Von Neumann derived a new expression 
for conditional probabilities in quantum mechanics that avoids probability 
amplitudes altogether and instead sets them equal to the trace of products of 
projection operators, as they are now called. Instead of the term 'projection 
operator', Von Neumann used the term Einzeloperator (or E.Op. for short; 
cf. note 116). The probability Pr(a|6), e.g., is given by Tr (E (a) F(b)), where 
E(a) and F(b) are projection operators onto, in Dirac notation, the 'eigen- 
vectors' | a) and \b) of the operators a and b, respectively. Unlike probability 
amplitudes, these projection operators do not have any phase ambiguity. This 
is easily seen in Dirac notation. The projection operator E(a) = \a){a\ does 
not change if the ket \a) is replaced by e J,? |a) and the bra (a\ accordingly by 
e (a\. We should emphasize, however, that, just as Jordan and Dirac with 
their probability amplitudes/transformation functions (a\b), von Neumann did 
not think of his projection operators as constructed out of bras and kets, thus 
avoiding the problem that many of these bras and kets are not in Hilbert 
space. 



Toward the end of his paper, Ivon Neumann! (11927al . pp. 46-47) noted that 



his trace expression for conditional probabilities is invariant under "canoni- 
cal transformations." What von Neumann called canonical transformations, 
however, are not Jordan's canonical transformations but simply, in modern 
terms, unitary transformations. Such transformations automatically preserve 
Hermiticity and the need for something like Jordan's Erganzungsamplitude 
simply never arises. Von Neumann noted that his trace expression for con- 
ditional probabilities does not change if the projection operators E and F 
are replaced by UEW and UFW, where U is an arbitrary unitary operator 
(W = U' 1 ). In von Neumann's approach, as becomes particularly clear in 
his second paper of 1927 (see below), one also does not have to worry about 
sorting variables into sets of mutually conjugate ones. This then is what the 
'never mind your p's and g's' part of the title of our paper refers to. By avoiding 
conjugate variables and canonical transformations, von Neumann completely 
steered clear of the problem that ultimately defeated Jordan's attempt to de- 
rive all of quantum mechanics from his set of axioms, namely that canonical 
transformations can never get us from p's and g's with fully continuous spectra 
to quantities with wholly or partly discrete spectra, such as the Hamiltonian. 

In Mathematische Begriindung, von Neumann not only provided an alternative 
to Jordan's analysis of probabilities in quantum mechanics, he also provided 
an alternative to the Dirac- Jordan transformation-theory ap proach to prov- 



ing the equivalence of matrix mechanics and wave mechanics (jvon Neumann 



54 Interestingly. Ivon Neumann! ( 1927al . pp. 43-44) mentioned Pauli, Dirac, and Jor- 
dan in this context, but not Born. Unlike Heisenberg (see note 10), von Neumann 
had no reason to dislike Born's work. A nd he did cite Born in the second paper of 



his 1927 trilogy on quantum mechanics ( von Neumann . 1927bl . p. 245). 
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1927al . p. 14). This is where von Neumann put the abstract notion of Hilbert 
space that he introduced in his paper to good use. He showed that matrix 
mechanics and wave mechanics correspond to two instantiations of abstract 
Hilbert space, the space I 2 of square- summ able sequen c es an d the space L 2 



of square-integrable functions, respectively (iDieudonnel . Il98ll . p. 172) J I As 



von Neumann reminded his readers, well-known theorems due to Parseval and 
Riesz and Fisher had established that I 2 and L 2 are isomorphic, 



5l) 



In his second 1927 paper, "Probability-theoretical con struction ( Wahrschein 



lichkeitstheoretischer Aufbau) of quantum mechanics," Ivon Neumann! (jl927bl ) 
freed himself even further from relying on the Dirac- Jordan approach. In 
Mathematische Begriindung he had accepted the Born rule and recast it in 
the form of his trace formula. In Wahrscheinlichkeitstheoretischer Aufbau he 
sought to derive this trace formula, and thereby the Born rule, from more 
fundamental assumptions about probability. Drawing on ideas of von Mises 
(see note 134), von Neumann started by introducing probabilities in terms 
of selecting members from large ensembles of systems. He then made two 
very general and prima facie perfectly plausible as sumptions about expe cta- 
tion values of quantities defined on such ensembles (jvon Neumann! Il927bl . pp. 
246-250). From t hose assumptions, som e assumptions about the repeatability 
of measurements ( von Neumann . 1927bi p. 271, p. 262, cf. note 137), and key 
features of his Hilbert space formalism (especially some assumptions about the 
association of observables with Hermitian operators), von Neumann did indeed 
manage to recover the Born rule. Admittedly, the assumptions needed for this 
result are not as innocuous as they look at first sight. They are essentially the 
same as those that go into von Neumann's infamous no -hidden- variable proof 
fTBell . Il966l : iBacciagaluppi and Crulll . 120091 : liubl l2010h . 



Along the way Ivon Neumann! ( )1927bl . p. 253) introduced what we now call a 
density operator to characterize the ensemble of systems he considered. He 
found that the expectation value of an observable represented by some opera- 
tor a in an ensemble characterized by p is given by Tr(pa), where we used the 
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Not usually given to hyperbole, Jammer ( 19661 . p. 316) was moved to comment 
on this feat: "In von Neumann's formulation of quantum mechanics was found the 
ultimate fulfillment — as far as the theory of quanta is concerned — of Hipparchus's 
insistence on the usefulness of investigating "why on two hypotheses so different 
from one another . . . the same results appear to follow," which, as we have seen, 
had already characterized Jordan's line of research." The references to Hipparchus 
and Jordan are to p. 307 of Jammer's book (see also p. 293, where Dirac is mentioned 
as well). 

56 In 1907-1908, Erhard Schmidt, a student of Hilbert who g ot his Ph.D. in 1905, 
fully worked out the theory of I 2 and called it 'H ilbert space' (SteerJ . 19731 . p. 364). 



1926bl . p. 197) used the 



In a paper on canonical transformations, Fritz lLondon 

term 'Hilbert space' for L 2 (Jammer, 1966, p. 298; Duncan and Janssen 2009, p. 
356, note 12). 
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modern notation p for the densi ty operator (vo n Neum ann used the letter U). 
This result holds both for what Ivon Neumann! (jl927bl ) called a "pure" (rein) 
or "uniform" (einheitlich) ensemble (p. 255), consisting of identical systems 
in identical states, and for what he called a "mixture" (Gemisch) (p. 265). 
So the result is more general than the Born rule, which obtains only in the 
former case. Von Neumann went on to show that the density operator for a 
uniform ensemble is just the projection operator onto the ray in Hilbert space 
corresponding to the state of all systems in this ensemble. However, he found 
it unsatisfactory to characterize the state of a physic al system by specifying 
a ray in Hilbert space. "Our knowledge of a system," Ivon Neumann! (11927b . 
p. 260) wrote, "is never described by the specification of a state . . . but, as 
a rule, by the results of experiments performed on the system." In this spirit, 
he considered the simultaneous measurement of a maximal set of commuting 
operators and constructed the density operator for an ensemble where what 
is known is that the corresponding quantities have values in certain intervals. 
He showed that such measurements can fully determine the state and that the 
density operator in that case is once again the projection operator onto the 
corresponding ray in Hilbert space. 



Von Neumann thus arrived at the typical quantum-mechanical way of con- 
ceiving of a physical problem nowadays, which is very different from the clas- 
sical way to which Jordan was still wedded in Neue Begriindung. In classical 
mechanics, as well as in Jordan's version of transformation theory, the full 
description of a physical system requires the specification of a complete set of 
p's and q's. In quantum mechanics, as was first made clear in von Neumann's 
Wahrscheinlichkeitstheoretischer Aufbau, it requires the specification of the 
eigenvalues of all the operators in a maximal set of commuting operators for 
the system. In other words, the 'never mind your p's and g's' part of the title 
of our paper carried the day. 



l.Jf. Outline of our paper 



In the balance of this paper we cover the contributions of Jordan and von 
Neumann (initially with Hilbert and Nordheim) to the developments sketched 
above in greater detail. We give largely self-contained reconstructions of the 
central arguments and derivations in five key papers written in Gottingen and, 

in one case (Neue Begriindung II), in Copenha g en over the span of just one 

year, from late 1926 t o late 1927 (|Jordaij . ll927bl Ja; iHilbert. von Neumann, and Nordheiml . 
1928 ; von Neumann . 1927al b). To make the arguments and derivations in these 



papers easier to follow for a modern reader, we translate them all into the kind 
of modern notation introduced in Sections 1.2 and 1.3. To make it easier for 
the reader to check our claims against the primary sources, we provide detailed 
references to the latter, including where necessary equation numbers and leg- 
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ends for the original notation. We will not cover Dirac (cf. note 6), although 
we will occa sionally refe r to his work, both his orig inal paper o n transforma- 
tion theory (Dirac, 1927 ) and the book based on it ( Dirac . 1930l ). We will also 
freely avail ourselves of his bra and ket notation. Our focus, however, will be 
on Jordan and von Neumann. 



We begin, in Section 2, with Neue Begriindung I f|Jordanl . ll927bl ). In this paper 
Jordan only dealt with quantities with completely continuous spectra, sug- 
gesting that the generaliza tion to ones wi th partly or wholly discrete spectra 
would be straightforward (Jordan, 1927bl . p. 811, p. 816). We cover Jordan's 
postulates for his probability amplitudes (Section 2.1) and his construction of 
a realization of these postulates, especially his use of canonical transforma- 
tions between pairs of conjugate variables to derive the differential equations 
for these amplitudes ( S ection 2.3). In Section 2.2, drawing on an earlier paper 
( iDuncan and Janssenl . 120091 ) . we remind the reader of the role of canonical 
transformations in matrix mechanics. In Section 2.4, we take a closer look at 
Jordan's notion of a supplementary amplitude [Erganzungsamplitude] . 



In Sec tion 3, we discuss the paper by iHilbert. von Neumann, and Nordheim 



( 119281 ). submitted in April 1927, that grew out of the expositi on of Jordan's ap- 
proac h in Hilbert's 1926/1927 course on quantum mechanics (ISauer and Majer 



2009 



pp. 698-706). Hil bert and his co-authors had the advantage of having 
read the paper in which iDirad (119271 ) presented his version of transformation 
theory. Jordan only rea d Dirac's paper when he was correcting the page proofs 



of Neue Begriindung I ( 1 Jordan! Il927bl . p. 809; note added in proof). 



In Section 4, we consider Neue Begriindung II (jJordanl . Il927gf ). received by 
Zeitschrift fiir Physik in early June 1927 and written in part in response 
to crit icism o f Neue Begriindung I by IHilbert. von Neumann, and Nordheim 
( 119281 ) and by lvon Neumann! ( 11927a! ) in Mathematische Begriindung. Since von 
Neumann introduced an entirely new approach, we deviate slightly from the 
chronological order of these papers, and discuss Mathe matisch e Begrii ndung 
after Neue Begriindung II. In the abstract of the latter, iJordanl (Il927gl . p. 1) 
promised "a simplified and general ized p r esenta tion of the theory developed in 
[Neue Begriindung] I." Drawing on lDirad (119271 ). Jordan simplified his notation 
somewhat, although he also added some new and redundant elements to it. 
Most importantly, however, the crucial generalization to quantities with partly 
or wholly discrete spectra turned out to be far more problematic than he had 
suggested in Neue Begriindung I. Rather than covering Neue Begriindung II 
in detail, we highlight the problems Jordan ran into, especially in his attempt 
to deal with spin in his new formalism. 



In Sections 5 and 6, we turn to the first two papers of von Neumann's trilogy 
on quantum m e chanics of 1927. In Section 5, on Mathematische Begriindung 
( Ivon Neumann! . Il927al ). we focus on von Neumann's criticism of the Dirac- 
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Jordan transformation theory, his proof of the equivalence of wave mechanics 
and matrix mechanics based on the isomorphism between L 2 and I 2 , and his 
derivation of the trace formula for probabilities in quantum mechanics. We 
do not cover the introduction of his Hilbert space formalism, which takes 
up a large portion of his paper. This material is covered in any number of 



modern books on fun ctional analysis! I In S ection 6, on Wahrscheinlichkeits- 



theoretischer Aufbau (jvon Neumann! . Il927d ). we likewise focus on the overall 



argument of the paper, covering the derivation of the trace formula from some 
basic assumptions about the expectation value of observables in an ensemble of 
identical systems, the introduction of density operators, and the specification 
of pure states through the values of a maximal set of commuting operators. 

In Section 7, we summarize the transition from Jordan's quantum-mechanical 
formalism rooted in classical mechanics (mind your p's and g's) to von Neu- 
mann's quantum-mechanical formalism which no longer depends on classical 
mechanics for its formulation (never mind your p's and g's). 

As a coda to our story, we draw attention to the reemergence of the canonical 
formalism, its generalized coordinates and conjugate momenta, even for spin-| 
particles, in quantum field theory. 



2 Jordan's Neue Begriindung I (December 1926) 



Neue Begriindung I was received by Zeitschrift fur Physik on December 18, 
1926 and published January 18, 1927 f|jordarJ . [T927bl l It consists of two parts. 
In Part One (I. Teil), consisting of sees. 1-2 (pp. 809-816), Jordan laid down 
the postulates of his theory. In Part Two (II. Teil), consisting of sees. 3-7 
(pp. 816-838), he presented the formalism realizing these postulates. In the 
abstract of the paper, Jordan announced that his new theory would unify all 
earlier formulations of quantum theory: 



The four forms of quantum mechanics that have been developed so far — 
matrix theory, the theory of Born and Wiener, wave mechanics, and q- 
number theory — are contained in a more general formal theory. Following 
one of Pauli's ideas, one can base this new theory on a fe w simple funda- 
mental postulates (Grundpostulate) of a statistical nature (j Jordan! . Il927b . 
p. 809). 



As we mentioned in Section 1.2, Jordan claimed that he could recover both 
the time-dependent and the time-independent Schrodinger equation as special 



57 See, e.g., iPrugoveckil ( 19811 ). or, for a more elementary treatment, w hich will be 
more than sufficient to follow our paper. iDennerv and Krzvwickil (]1996l . Ch. 3). 
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cases of the differential equations he derived for the probability amplitudes 
central to his formalism. This is the basis for his claim that wave mechanics 
can be subsumed under his new formalism. Nowhere in the paper did he show 
explicitly how matrix mechanics is to be subsumed under the new formalism. 
Perhaps Jordan felt that this did not require a special argument as the new 
formalism ha d grown natura lly out of matrix mechanics and his own contri- 
butions to it (Jordan! . ll926aU bl). However, as emphasized repeatedly already, 
Jordan! (Il927bl ) restricted himself to quantities with purely continuous spectra 
in Neue Begriindung I, so the formalism as it stands is not applicable to ma- 
trix mechanics. Like Dirac's (1927) own version of statistical transformation 
theory, Jordan's version can be seen as a natural extension of Dirac's (1925) 
g-number theory. It is only towar d the end of his paper ( sec. 6) that Jordan 
turned to the operator theory of iBorn and Wiener! (119261 ). In our discussion 
of Neue Begriindung I, we omit this section along with some mathematically 
intricate parts of sees. 3 and 5 that are not necessary for understanding the 
paper's overall argument. We do not cover the concluding sec. 7 of Jordan's 
paper either, which deals with quantum jumps (recall his earlier paper on this 
topic [Jordan, 1927a], which we briefly discussed in Section 1.1). 



Although we will not cover Jordan's unification of the various forms of quan- 
tum theory in any detail, we will cover (in Section 5) von Neumann's criticism 
of the Dirac- Jordan way of proving the equivalence of matrix mechanics and 
wave m echanics as a p r elude t o his own proof based on the isomorphism of I 2 
and L 2 fjvon Neumann! . 1 1927al ). In our discussion of Neue Begriindung I in this 
section, we focus on the portion of Jordan's paper that corresponds to the last 
sentence of the abstract, which promises a statistical foundation of quantum 
mechanics. Laying this foundation actually takes up most of the paper (sees. 
1-2, 4-5). 



2.1 Jordan's postulates for probability amplitudes 



The central quantities in Neue Begriindung I are generalizations of Schrodinger 
energy eigenfunctions which Jordan called "probability amplitudes." He at- 
tributed both the generalization an d the t e rm to Pauli. Jordan referred to a 
footnote in a forthcoming paper by iPaulil (11927al . p. 83, note) proposing, in 
Jordan's terms, the following interpretation of the energy eigenfunctions <p n {q) 
(where n labels the different energy eigenvalues) of a system (in one dimen- 
sion): "If <p n (q) is normalized, then \<p n (q)\ 2 dq gives the probability that, if the 
system is in the state n, the coordinate [q] has a value between q and q + dq" 
( Jordan . 1927bl . p. 811). A probability amplitude such as this one for position 
and energy can be introduced for any two quantities. 



In Neue Begriindung I, Jordan focused on quantities with completely con- 
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tinuous spectra. He only tried to extend his approach, with severely limited 
success, to partly or wholly discrete spectra in Neue Begriindung II (see Section 
4). For two quantities x and y that can take on a continuous range of values 
x and y, respectively]^] there is a complex probability amplitude ip(x, y) such 
that \<p(x, y)\ 2 dx gives the probability that x has a value between x and x + dx 
given that y has the value y. 

In modern Dirac notation (p(x, y) would be written as (x\y) (cf. our discussion 
in Section 1.2). Upon translation into this modern notation, many of Jordan's 
expressions turn into instantly recognizable expressions in modern quantum 
mechanics and we will frequently provide such translations to make it easier 
to read Jordan's text. We must be careful, however, not to read too much into 
it. First of all, von Neumann had not yet introduced the abstract notion of 
Hilbert space when Jordan and Dirac published their theories in early 1927, so 
neither one thought of probability amplitudes as 'inner products' of 'vectors' 
in Hilbert space at the time. More importantly, for quantities x's and y's 
with purely continuous spectra (e.g., position or momentum of a particle in 
an infinitely extended region), the 'vectors' \x) and \y) are not elements of 
Hilbert space, although an inner product (x\y) can be defined in a generalized 
sense (as a distribution) as an integral of products of continuum normalized 
wave functions, as is routinely done in elementary quantum mechanics. That 
continuum eigenstates can be treated as though they are indeed states in 
a linear space satisfying completeness and orthogonality relations which are 
continuum analogs of the discrete ones which hold rigorously in a Hilbert space 
is, as we will see later, just the von Neumann spectral theorem for self-adjoint 
operators with a (partly or wholly) continuous spectrum. 



In the introductory section of Neue Begriindung I, iJordanl (Il927bl . p. 811) 
listed two postulates, labeled I and II. Only two pages later, in sec. 2, enti- 
tled "Statistical foundation of quantum mechanics," these two postulates are 



superseded by a new set of fou r postulates, labeled A through Dl 59 1 In Neue 



Begriindung II, IJordanl (jl927gl p. 6) presented yet another set of postulates 



three this time, labeled I through III (see Section 4)[£^J The exposition of 



58 Recall that this is our notation (cf. note 27): Jordan used different letters for 
quantities and their numerical values. For instance, he used q (with val ue x) and (3 
(with value y) for what we called x and y, respectively ( Jordan . 1927bl . p. 813) 

59 In the short versio n of Ne u e Begr iindung I presented to the Gottingen Academy 
on January 14, 192 7. IJordanl (ll927d . p. 162) only introduced postulates I and II. In 
this short version, IJordanl (ll927d . p. 163) referred to "a soon to appear extensive 
paper in Zeitschrift fiir Physik" (i.e., Jordan, 1927b). 

60 In his ove rview of recent developments in quantum mechanics in Die Naturwis- 



senschaften, IJordanl (|1927il . Pt. 2, p. 648), after explaining the basic notion of a 
probability amplitude (cf. Postulate A below), listed only two postulates, or ax- 
ioms as he now called them, namely "the assumption of probability interference" 
(cf. Postulate C below) and the requirement that there is a canonically conjugate 
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Jordan's theory by iHilbert. von Neumann, and Nordheiml ( 119281 ) . written in 



between Neue Begriindung I and II, starts from six "physical axioms" (pp. 
4-5), labeled I through VI (see Section 3). We will start from Jordan's four 
postulates of Neue Begriindung I, which we paraphrase and comment on below, 
staying close to Jordan's own text but using the notation introduced above to 
distinguish between quantities and their numerical values. 

Postulate A. For two mechanical quantities q and (3 that stand in a definite 
kinematical relation to one another there are two complex-valued functions, 
ip(q, (3) and ip(q, (3), such that <p(q, (3)ip*(q, (3)dq gives the probability of finding 
a value between q and q + dq for q given that j3 has the value (3. The function 
ip(q, j3) is called the probability amplitude, the function ijj(q,(3) is called the 
"supplementary amplitude" [Erg dnzungs amplitude) . 

Comments: As becomes clear later on in the paper, "mechanical quantities 
that stand in a definite kinematical relation to one another" are quantities 
that can be written as functions of some set of gene ralized c oordina tes and 
their conjugate momenta. In his original postulate I, Ijordanl (Il927bl p. 162) 



wrote that "<p(q, (3) is independent of the mechanical nature (the Hamiltonian) 
of the system and is determined only by the kinematical relation between q 
and /3" (hats added). Hilbert et al. made this into a separate postulate, their 
axiom V: "A further physical requirement is that the probabilities only de- 
pend on the functional nature of the quantities Fi(pq) and F 2 (pq), i.e., on their 
kinematical connection [ Verkniipfung] , and not for instance on additional spe- 
cial properties of the mec hanical system under consideration, su c h as, for ex- 



ample, its Hamiltonian" (IHilbert. von Neumann, and Nordheiml . Il928l . p. 5). 



With <p(q,f3) = (q\(3), the statement about the kinematical nature of proba- 
bility amplitudes translates into the observation that they depend only on the 
inner-product structure of Hilbert space and not on the Hamiltonian governing 
the time evolution of the system under consideration, 
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It turns out that for all quantities represented, in modern terms, by Her- 
mitian operators, the amplitudes ip(q, (3) and ip(q, (3) are equal to one an- 
other. At this point, however, Jordan wanted to leave room for quantities 
represented by non-Hermitian operators. This is directly related to the cen- 
tral role of canonical transformations in his formalism. As Jordan (1926a,b) 
had found in a pair of papers published in 1926, canonical transformations 
need not be unitary and therefore do not always preserve the Hermiticity 



of the conjugate variables one starts from (IDuncan and Janssenl . 120091 ). The 



Erganzungsamplitude does not appear in the presentation of Jordan's formal- 



quantity p for every quantum-mechanical quantity q (cf. Postulate D below). 
61 In the AHQP interview with Jordan, Kuhn emphasized the importance of this 
aspect of Jordan's formalism: "The terribly important step here is throwing the 
particular Hamiltonian function away and saying that the relationship is only in 
the kinematics" (session 3, p. 15). 
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is m bvlHilbert. von Neumann, and Nordheiml ( 119281 ) J 62 1 In Neue Begriindung 
II, iJordanl ( 1l927gl . p. 3) restricted himself to Hermitian quantities and silently 
dropped the Erganzungsamplitude. We return to the Erganzungsamplitude in 
Section 2.4 below, but until then we will simply set ijj{q, (3) = ip(q, (3) every- 
where. 



Postulate B. The probability amplitude (p(/3, q) is the complex conjugate of the 
probability amplitude cp(q, (3). In other words, <p(/3, q) = <p*(q, 0). This implies 
a symmetry property of the probabilities themselves: the probability density 
\<p((3, q)\ 2 for finding the value f3 for {3 given the value q for q is equal to the 
probability density \<p(q, (3)\ 2 for finding the value q for q given the value (3 for 
(3. 



Comment. This property is immediately obvious once we write tp(q, /?) as 
(q\(3) with the interpretation of (q\f3) as an 'inner product' in Hilbert space 
(but recall that one has to be cautious when dealing with quantities with 
continuous spectra). 



Postula te C. The probabilities combine through interference. In sec. 1. 1 Jordan 



( Il927bl . p. 812) already introduced the phrase "interference of probabilities" 
to capture the striking feature in his quantum formalism that the probability 
amplitudes rather than the probabilities themselves follow the usual compo- 
sition rules for probabilitiesJ 63 1 Let F\ and F2 be two outcomes [Tatsachen] 
for which the amplitudes are ipi and ip 2 . If F\ and F2 are mutually exclu- 
sive, ipi + ip 2 is the amplitude for the outcome l F\ or F 2 . If F\ and F 2 are 
independent, fif 2 is the amplitude for the outcome 'Fi and F 2 . 



Consequence. Let <p(q, 0) be the probability amplitude for the outcome Fx of 
finding the value q for q given the value (3 for (3. Let x{Qi q) be the probability 
amplitude for the outcome F 2 of finding the value Q for Q given the value 
q for q. Since F\ and F 2 are independent, Jordan's multiplication rule tells 
us that the probability amplitude for l F\ and F 2 is given by the product 
x(Q, q) <p(q, (3)- Now let f3) be the probability amplitude for the outcome 
F 3 of finding the value Q for Q given the value [3 for [3. According to Jordan's 
addition rule, this amplitude is equal to the 'sum' of the amplitudes for L F 1 
and F 2 for all different values of q. Since q has a continuous spectrum, this 
'sum' is actually an integral. The probability amplitude for F 3 is thus given 



62 Both ip and if are introduced in the lectures by Hilbert on which this paper is 
based but they are set equal to one another almost immediately and without any 
further explanation ( Sauer and Maier . 20091 . p. 700). 

63 Recall, however, Heisenberg ' s criti cism of this aspect of Jordan's work in his 
uncertainty paper (jHeisenberei . Il927bl . pp. 183-184; cf. note 38). 
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HQ, P) = J X(Q, q) <f(q, P) dq. [NBl, sec. 2, Eq. 14] (1) 



Spec ial case. If Q = /3, the amplitude becomes the Dirac delta func- 

tion. IJordanl fll927bl . p. 814) introduced the notation Sppn even though (3' and 
0" are continuous rather than discrete variables [NBl, sec. 2, Eq. 16]. In a 
footnote he conceded th at this is mathematically dubious. In Ne u e Beq run- 
dung II, IJordanl (Il927gl . p. 5) used the delta function that iDirad (119271 . pp. 



625-627) had meanwhile introduced in his paper on transformation theory. 
Here and in what follows we will give Jordan the benefit of the doubt and 
assume the normal properties of the delta function 



<>.-> 



Using that the amplitude %(/?', q) is just the complex conjugate of the ampli- 
tude ip(q,(3'), we arrive at the following expression for $(/3',/3"): 

$(/?', 0") = J tp*(q, (3') tp(q, (3") dq = S^n. [NBl, sec. 2, Eqs. 15, 16, 17] (2) 



Comment. Translating Eqs. (l)-(2) above into Dirac notation, we recognize 
them as familiar completeness and orthogonality relations 



(Q\P)= (Q\q){q\P)dq, W) = / C^lff) W) dq = 6(J? - /?")■ (3) 



Since the eigenvectors \q) of the operator q a re not in Hilbert space, the spec- 
tral theorem, first proven by lvon Neumann! (I1927al ). is required for the use of 
the resolution of the unit operator 1 = / dq\q)(q\. 

Postulate D. For every q there is a conjugate momentum p. Before stating this 
postulate, Jordan offered a new definition of what it means for p to be the 
conjugate momentum of q. If the amplitude p(p, q) of finding the value p for 
p given the value q for q is given by 



p(p, q) 



-ipq/h 



[NBl, sec. 2, Eq. 18] 



(4) 



then p is the conjugate momentum of q. 



64 We include the numbers of the more important equat ions in t he orig inal papers 
in square brackets. 'NBl' refers to Neue Begriindung I ()jordaiJ . [i^27bn . Since the 
numbering of equations starts over a few times in this paper (see note 79), we will 
often include the section number as well. 

35 For a brief histor y of the delta function that focuses on its role in quantum 



mechanics, see, e.g., Jammer ( 19661 . pp. 301-302, pp. 313-314). 



The notation (Q\(3) for $>(Q,/3) etc. obviates the need for different letters for 
different probability amplitudes that plagues Jordan's notation. 
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Anticipating; a special case of the uncertainty principle (cf. notes 36 and 44), 



Jordan! ( ]1927bl . p. 814) noted that Eq. (4) implies that "[f]or a given value of 



q all possible values of p are equally probable." 

For p's and q's with completely continuous spectra, Jordan's definition of when 
p is conjugate to q is equivalent to the standard one that the operators p and q 
satisfy the commutation relation [p, q] = pq—qp = h/i. This equivalence, how- 
ever, presupposes the usual association of the differential operators (h/i) d/dq 
and 'multiplication by q' with the quantities p and q, respectively. As we em- 
phasized in Section 1.2, Jordan did not think of these quantities as operators 
acting in an abstract Hilbert space, but he did associate them (as well as 
any other quantity obtained through adding and multiplying p's and g's) with 
the differential operators (h/i) d/dq and q (and combinations of them). The 
manipulations in Eqs. (19ab)-(24) of Neue Begriindung I, presented under the 
subheading "Consequences" (Folgerungen) immediately following post ulate D 



are me ant to show that this association follows from his postulates (IJordan 




1927bl . pp. 814-815). Using modern notation, we reconstruct Jordan's rather 
convoluted argument. As we will see, the argument as it stands does not work, 
but a slightly amended version of it does. 

The probability amplitude (p\q) = e~ ipg ^ h , Jordan's p(p,q), trivially satisfies 
the following pair of equations: 



0, [NBl, sec. 2, Eq. 19a] (5) 



(-w- + 1 1 (Pl?> = 0- t NB1 > sec - 2 > E q- 19b l ( 6 ) 
\i op ) 

Unless we explicitly say otherwise, expressions such as (a\b) are to be inter- 
preted as our notation for Jordan's probability amplitudes <p(a, b) and not as 
inner products of vectors \a) and \b) in Hilbert space. 

Following Jordan (NBl, sec. 2, Eqs. 20-22), we define the map T, which takes 
functions / of p and turns them into functions Tf of Q (the value of a new 
quantity Q with a fully continuous spectrum): 

T: f(p) -> (Tf)(Q)=J(Q\p)f(p)dp. (7) 

In other words, T . . . = J dp (Q\p) . . . [NBl, Eq. 21]. For the special case that 
f(p) = (p\q), we get: 



(T(p\q)) (Q) = J (Q\p)(p\q)dp = (Q\q), (8) 
where we used completeness, one of the consequences of Jordan's postulate C 
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(cf. Eqs. (l)-(3)). In other words, T maps (p\q) onto (Q\q) 

(Q\q) = T (p\q). [NBl, sec. 2, Eq. 22] (9) 

Likewise, we define the inverse map T _1 , which takes functions F of Q and 
turns them into functions T~ l F of pj 68 

T- 1 : F(Q) -+ (T- 1 F)(p)=j{p\Q)F(Q)dQ. (10) 



In other words, T 1 . . . = / dQ (p\Q) . . L^ll For the special case that F(Q) = 
(Q\q) we get (again, by completeness): 

T- l {Q\q)) (p) = J ( P \Q}(Q\q)dQ = (p\q), (11) 
or, more succinctly, 

(p|?>=T- 1 <g| g >. (12) 
Applying T to the left-hand side of Eq. (5) [NBl, Eq. 19a], we find: 

t((p+ ~] (p\q)) = Tp(p\q) + ~T<p|g> = 0, (13) 

where we used that differentiation with respect to q commutes with applying 
T (which only affects the functional dependence on p). Using that (p\q) = 
T~Hg\q) (Eq. (12)) and T(p\q) = (Q\q) (Eq. (9)), we can rewrite Eq. (13) 

as: 
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(rpT" 1 + j-^j (Q\q) = 0. [NBl, sec. 2, Eq. 23a] 



(14) 



67 At this point, Jordan's notation, ip(x,y) = T.p{x,y) [NBl, sec. 2, Eq. 22], gets 
particularly confusing as the x on the left-hand side and the x on the right-hand 
side refer to values of different quantities. The same is true for the equations that 
follow [NBl, Eqs. 23ab, 24ab]. 

68 To verify that T~ l is indeed the inverse of T, we take F(Q) in Eq. (10) to be 
(Tf)(Q) in Eq. (7). In that case we get: 



(T~ 1 Tf)(p) = J (p\Q) (| (Q\p')f( P ')dp^ dQ 
(p\Q)(Q\p)f(p)dQdp' 
(p\p')f(p')dp' = f(p), 



w here we used th e resolution of unity, 1 = J dQ \Q)(Q\, and (p\p') = 5(p — p'). 
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Jordan! (jl927bl . p. 815, note) used the Erg anzung Samplitude to represent T _1 in 
this form. 

70 There is a sign error in NBl, sec. 2, Eq. 23a: —TxT" 1 should be TxT^ 1 . 
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Similarly, applying T to the left-hand side of Eq. (6) [NBl, Eq. 19b], we find: 

T + q ) {plq) ) = T ^ {plq) + qT{plq) = °' (15) 

where we used that multiplication by q commutes with applying T. Once again 
using that (p\q) = T^ 1 (Q\q) and T(p\q) = (Q\q), we can rewrite this asi 71 

T-—T- 1 + q)(Q\q) = 0. [NBl, sec. 2, Eq. 23b] (16) 



Eqs. (14) and (16) [NBl, Eqs. 23ab] gave Jordan a representation of the quanti- 
ties p and q in the Q-basis. The identification of p in the Q-basis is straightfor- 
ward. The quantity p in Eq. (5) [NBl, Eq. 19a] turns into the quantity TpT~ x 
in Eq. (14), [NBl, Eq. 23a]. This is just what Jordan had come to expect on 
the basis of his earlier use of canonical transformations (see Section 2.2 below). 
The identification of q in the Q-basis is a little trickier. Eq. (6) [NBl, Eq. 19b] 
told Jordan that the position operator in the original p-basis is —(h/i) d/dp 
(note the minus sign). This quantity turns into — T (h/i) d/dpT" 1 in Eq. (16) 
[NBl, Eq. 23b]. This then should be the representation of q in the new Q-basis, 
as Jordan stated right below this last equation: "With respect to (in Bezug 
auf) the fixed chosen quantity \Q) every other quantit y [a ] corresponds to an 



operator [-T (h/i) d/dp T' 1 }" flJordanl . Il927bl . p. 815) 



72 



71 There is a sign error in NBl, sec. 2, Eq. 23b: — y should be y. The sign errors in 
Eqs. (14)-(16) confused Heisenberg, who was relying on Neue Begriindung I for the 
mathematical part of his uncertainty paper (see note 36). He wrote to Jordan to 
ask for clarification: 



Today just a quick question, since I have been trying in vain, enduring persistent 
fits of rage, to derive your Eq. (23a and b) from (19a and b) in your transformation 
paper. According to my certainly not authoritative opinion it should be p = e +xy / £ 
and not p = e~ xy / £ , for out of (x + ed/dy)p(x,y) [NBl, Eq. 19a, our Eq. (5)] 
I always get —God be darned — (+TxT^ + e d/dy) ip(x, y) = [NBl, Eq. 23a, 
our Eq. (14)]. Now it's possible that I am doing something nonsensical with 
these constantly conjugated quantities (F, F*, F^: read: F-blurry, F-ill, and F- 
deceased), but I don't understand anything anymore. Since, however, the quantity 
p(x, y) forms the basis of my mathematics, I am kindly asking you for clarification 
of the sign (Heisenberg to Jordan, March 17, 1927, AHQP). 

Jordan's response apparently has not been preserved but from another letter from 
Heisenberg to Jordan a week later, we can infer that Jordan wrote back that the 
expression for p(x, y) in Neue Begriindung I is correct but that there are sign errors in 
Eqs. (23ab). In the meantime Heisenberg had submitted his uncertainty paper and 
replied: "I now fully agree with your calculations and will change my calculations 
accordingly in the proofs" (Heisenberg to Jordan, March 24, 1927, AHQP). 
72 Because of the sign error in Eq. (16) [NBl, Eq. 23b], Jordan set q in the Q-basis 
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With these representations of his quantum-mechanical quantities p and q, 
Jordan could now define their addition and multiplication through the corre- 
sponding addition and multiplication of the differential operators representing 
these quantities. 

Jordan next step was to work out what the differential operators TpT~ l and 
—T(K/i) d/dpT -1 , representing p and q in the Q-basis, are in the special case 
that Q = q. In that case, Eqs. (14) and (16) [NB1, Eqs. 23ab] turn into: 

{ TpT ~ 1 + ^) {q ' lq) = °> (17) 

(T~T- 1 + q ] J(q'\q) = 0. (18) 
On the other hand, (q'\q) = S(q' — q). So (q'\q) trivially satisfies: 

(t^ + m) {q ' lq) = °' [NB1) sec - 2 ' Eq - 24a] (19) 

(-q' + q) (q'\q) = 0. [NB1, sec. 2, Eq. 24b] (20) 



Comparing Eqs. (19)-(20) with Eqs. (17)— (18), we arrive at 

TpT-'Wlq) = ~W\q), (21) 

-T-^T~\q'\q) = q'{q'\q). (22) 
% op 

Eq. (21) suggests that TpT -1 , the momentum p in the g-basis acting on the q' 
variable, is just (h/i) d/dq' . Likewise, Eq. (22) suggests that — T (h/i) d/dpT^ 1 
the position q in the g-basis acting on the q' variable, is just multiplication by 
q' . As Jordan put it in a passage that is hard to follow because of his confusing 
notation: 



Therefore, as a consequence of (24) [our Eqs. (19)-(20)], the operator x 
[multiplying by q' in our notation] is assigned (zugeordnet) to the quantity 
[Grosse] Q itself [q in our notation]. One sees furthermore that the operator 
ed/dx [(h/i) d/dq' in our notatio n] corresponds to the momentum P [p] 
belonging to Q [q] fljordanl . fl927bl . p. 815)0 



It is by this circuitous route that Jordan arrived at the usual functional 
interpretation of coordinate and momentum operators in the Schrodinger 



equal to T((h/i) d/dp)T~ l . 

73 We remind the reader that Jordan used the term 'operator' [Operator] not for 
an operator acting in an abstract Hilbert space but for the differential operators 
(h/i)d/dx and (multiplying by) x and for combinations of them. 
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formalism. iJordanl ( Il927bl . pp. 815-816) emphasized that the association of 
(h/i) d/dq and q withp and q can easily be generalized. Any quantity (Grosse) 
obtained through multiplication and addition of q and p is associated with the 
corresponding combination of differential operators q and (h/i) d/dq. 

Jordan's argument as it stands fails. We cannot conclude that two operations 
are identical from noting that they give the same result when applied to one 
special case, here the delta function (q'\q) = S(q' — q) (cf. Eqs. (21)-(22)). 
We need to show that they give identical results when applied to an arbitrary 
function. We can easily remedy this flaw in Jordan's argument, using only 
the kind of manipulations he himself used at this point (though we will do 
so in modern notation). We contrast this proof in the spirit of Jordan with 
a modern proof showing that Eqs. (14) and (16) imply that p and q, now 
understood in the spirit of von Neumann as operators acting in an abstract 
Hilbert space, are represented by (h/i) d/dq and q, respectively, in the g-basis. 
The input for the proof a la Jordan are his postulates and the identification of 
the differential operators representing momentum and position in the Q-basis 
as TpT~ l and —T(h/i)d/dpT~ l , respectively (cf. our comments following 
Eq. (16)). The input for the proof a la von Neumann are the inner-product 
structu re of Hilbert sp a ce and the spectral decomposition of the operator p. Of 



course, ivon Neumann! ( 1l927a[ ) only introduced these elements after Jordan's 
Neue Begriindung I. 

Closely following Jordan's approach, we can show that Eqs. (14) and (16) 
[NB1, Eqs. 23ab] imply that, for arbitrary functions F(Q), if Q is set equal to 

q, 

(TpT~ l F)(q) = ~F(q), (23) 
i dq 



(24) 



Since F is an arbitrary function, the problem we noted with Eqs. (21)-(22) 
is solved. Jordan's identification of the differential operators representing mo- 
mentum and position in the g-basis does follow from Eqs. (23)-(24). 

To derive Eq. (23), we apply T, defined in Eq. (7), to p(T~ l F)(p). We then 
use the definition of T -1 in Eq. (10) to write (TpT~ 1 F)(Q) as: 



(TpT~ l F)(Q) = J (Q\p) p (T~ l F)(p) dp 



(Q\p) p 



(p\Q')F{Q')dQ' 



dp 



= JJ(Q\p)p( P \Q')F(Q')dpdQ'. (25) 
We now set Q = q, use Eq. (5) to substitute —(h/i) d/dq' (p\q') for p(p\q'), 
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and perform a partial integration: 



(TpT^F^q) = U (q\p) p (p\q')F(q') dpdq' 

= JJ{Q]p) {-^(pW^F(q')dpdq' 

= JJ(q\p)(p\q')j^^dpdq'. (26) 

On account of completeness and orthogonality (see Eq. (3) [NB1, Eqs. 14- 
17]), the right-hand side reduces to (h/i) F'(q). This concludes the proof of 
Eq. (23). 

To derive Eq. (24), we similarly apply T to —(h/i) d/dp (T~ 1 F)(p): 



T-^T-'F] (Q) = -f (Q\ P ) ~ (T~ 1 F)(p) dp 
i op ) J i op 



-f{Q\p)^[f(p\Q')F(Q')dQ' 
h d 



dp 



(Q\p) -jr (p\Q')F(Q')dpdQ'. (27) 
% op 

We now set Q = q and use Eq. (6) to substitute q'(p\q') for —{h/i) d/dp (p\q'): 

(-T—^Fj (q) = JJ(q\p)q'(p\q')F(q')dpdq' = qF(q), (28) 

where in the last step we once again used completeness and orthogonality. 
This concludes the proof of Eq. (24). 

We now turn to the modern proofs. It is trivial to show that the representation 
of the position operator q in the g-basis is simply multiplication by the eigen- 
values q. Consider an arbitrary eigenstate \q) of position with eigenvalue q, 
i.e., q\q) = q\q). It follows that (Q\ q \ q) = q(Q\q), where \Q) is an arbitrary 
eigenvector of an arbitrary Hermitian operator Q = with eigenvalue Q. 
The complex conjugate of this last relation, 

(q\q\Q) = q(q\Q), (29) 
is just the result we wanted prove. 

It takes a little more work to show that Eq. (14) [NB1, Eq. 23a] implies that 
the representation of the momentum operator p in the g-basis is (h/i) d/dq. 
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Consider Eq. (25) for the special case F(Q) = (Q\q): 

TpT-'iQlq) = JJ(Q\p)p (p\Q')(Q'\q) dpdQ'. (30) 

Recognizing the spectral decomposition / dpp \p){p\ of p in this equation, we 
can rewrite it as: 

TpT~ 1 (Q\q) = J(Q\p\Q')(Q'\q)dQ'=(Q\p\q), (31) 

where in the last step we used the resolution of unity, 1 = / dQ' \Q'){Q'\. Eq. 
(14) tells us that 

TpT- l {Q\q) = -~{Q\q). (32) 
% oq 

Setting the complex conjugates of the right-hand sides of these last two equa- 
tions equal to one another, we arrive at: 

(q\P\Q) = ^(q\Q), (33) 

which is the result we wanted to prove. Once again, the operator Q with 
eigenvectors \Q) is arbitrary. If Q is the Hamiltonian and q is a Cartesian 
coordinate, (q\Q) is just a Schrodinger energy eigenf unction. 

With these identifications of p and q in the g-basis we can finally show that 
Jordan's new definition of conjugate variables in Eq. (4) [NB1, Eq. 18] re- 
duces to the standard definition, [p, q] = h/i, at least for quantities with com- 
pletely continuous spectra. Letting [(h/i) d/dq, q] act on an arbitrary function 
f(q), one readily verifies that the result is (h/i) f(q)- Given the association of 
(h/i) d/dq and q with the quantities p and q that has meanwhile been estab- 
lished, it follows that these quantities indeed satisfy the usual commutation 
relation 

\p,q]=pq-qp = -. [NBl,Eq. 25] (34) 

This concludes Part I (consisting of sees. 1-2) of Neue Begriindung I. Jordan 
wrote: 

This is the content of the new theory. The rest of the paper will be devoted, 
through a mathematical discussion of these differential equations [NB1, Eqs. 
23ab, our Eqs. (14)-(16), and similar equations for other quantities], on the 
one hand, to proving that our postulates are mathematically consistent 
[widerspruchsfrei] and, on the other hand, to showing that the earli er forms 



[Darstellungen] of quantum mechanics are contained in our theory (IJordan 
1927bl . p. 816). 



In this paper we focus on the first of these tasks, which amounts to providing 
a realization of the postulates discussed in this secton. 



43 



2.2 Canonical transformations in classical mechanics, the old quantum the- 
ory and matrix mechanics 



Given the central role of canonical transformations in Neue Begriindung, we 
insert a brief subsection to review the use of canonical transformations in 
the developments leading up to it J 74 1 Canonical transformations in classical 



physics are transformations of the position and conjugate momentum variables 
(q,p) that preserve the form of Hamilton's equations, 



dHjp, q) 
dp 



P 



dHjp, q) 
dq 



(35) 



Following iJordanl ( 1l927bl . p. 810) in Neue Begriindung I, we assume that the 
system is one-dimensional. For convenience, we assume that the Hamiltonian 
H(p, q) does not explicitly depend on time. The canonical transformation to 
new coordinates and momenta (Q, P) is given through a generating function, 
which is a function of one of the old and one of the new variables. For a 
generating function of the form F(q, P), for instance] 75 1 we find the equations 



for the canonical transformation (q,p) — > (Q,P) by solving the equations 



P 



dF(q,P) 
dq 



Q 



dF(q,P) 
dP 



(36) 



for Q{q, p ) an d P(q,p). This transformation preserves the form of Hamilton's 
equations 
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Q 



dH(P,Q) 



P 



dH{P,Q) 



(37) 



dP ' " dQ 
where the Hamiltonians H(p, q) and H(P, Q) are numerically equal to one 
another but given by different functions of their respective arguments. One 
way to solve the equations of motion is to find a canonical transformation 
such that, in terms of the new variables, the Hamiltonian depends only on 
momentum, H(P, Q) = H(P). Such variables are called action-angle variables 
and the standard notation for them is (J, w). The basic quantization condition 
of the old quantum theory of Bohr and Sommerfeld restricts the value of a 
set of action variables for the system under consideration to integral multiples 
of Planck's constant, J = nh (n = 0, 1, 2, . . .). Canonical transformations to 
action-angle variables thus played a central role in the old quantum theory. 
With the help of them, the energy spectrum of the system under consideration 
could be found. 



74 This subsection is bas ed onlDuncan and Janssenl ( 20091 . sec. 2). 

75 In the classification of Goldstein et all ( 20021 . p. 373. table 9.1), this corresponds 
to a generating function of type 2, F2(q,P). The other types depend on (q,Q), 
(p,Q), or (p,P). Which type one chooses is purely a matter of convenience and 
does not affect the physical content. 
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For elementary discussion, see, e.g.. lDuncan and Janssenl (|2007l . Pt. 2, sec. 5.1 
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In classical mechanics, canonical transformations preserve the so-called Pois- 
son bracket, {p, q} = 1. For any two phase-space functions G(p,q), H(p,q) of 
the pair of canonical variables (p, q), the Poisson bracket is defined as 

{G(p, q), Hip, q)} = MM MM _ dH(p,g)dG(p,q)_ (3g) 



dp 



dq 



dp 



dq 



For G(p, q) = p and H(p, q) = q, this reduces to {p, q} = 1. We now compute 
the Poisson bracket {P, Q} of a new pair of canonical variables related to (p, q) 
by the generating function F(q,P) as in Eq. (36): 



{P(p,q),Q(p,q)} 



dP(p, q) dQ{p, q) dQ{p, q) dP(p, q) 



dp dq dp 
By the usual chain rules of partial differentiation, we have 



dq 



(39) 



dQ 


d 2 F 


dP 


dp 


dP 2 


dp 

q f q 



dQ 
dq 



d 2 F d 2 F 



dqdP dP 2 



dP 
dq 



(40) 
(41) 



Substituting these two expressions into Eq. (39), we find 

d 2 F 



{P(p,q),Q(p,q)}=^ 



d 2 F 



dqdP dP 2 



dP 
dq 

d 2 F 



dP 2 



dP 
dp 



dP 



d 2 F dP 



dqdP dp 

The final line is identically equal to 1, as 

d 2 F dp 
dqdP = dP 



(42) 



dP 
dp 



This shows that the Poisson bracket {p, q} 
canonical transformations. 
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1 is indeed invariant under 



In matrix mechanics a canonical transformation is a transformation of the 
matrices (q,p) to new matrices (Q,P) preserving the canonical commutation 
relations ^ 

[p, q]=pq-qp= - (44) 
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that replace the Poisson bracket {p, q] = 1 in quantum mechanics. Such trans- 
formations are of the form 



P = TpT-\ Q = TqT' 1 , H = THT~\ (45) 

where H is obtained by substituting TpT~ x for p and TqT' 1 for q in the 
operator H given as a function p and q. One easily recognizes that this trans- 
formation does indeed preserve the form of the commutation relations (44): 
[P, Q] = h/i. Solving the equations of motion in matrix mechanics boils down 
to finding a transformation matrix T such that the new Hamiltonian H is 
diagonal. The diagonal elements, H mm , then give the (discrete) energy spec- 
trum. 



In two papers before Neue Begriindung, iJordanl ( 1l926al lbl) investigated the 
relation between the matrices T implementing canonical transformations in 
matrix mechanics and generating functions in classical mechanics. He showed 
that the matrix T corresponding to a generating function of the forrrl 77 ! 



F(p,Q)=T,fM9n(Q), 



(46) 



is given by 



T(q,p) = exp- I (p,q) -J^(/„(p), g n (q)) 



(47) 



where the notation (. , .) in the exponential signals an ordering such that, when 
the exponential is expanded, all p's are put to the left of all g's in every term 
of the expansion 
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When he wrote Neue Begriindung I in late 1926, Jordan was thus steeped in 
the use of canonical transformations, both in classical and in quantum physi cs. 
When Kuhn asked Jordan about his two papers on the topic (Jordan, 1926al lbl) 
in an interview for the AHQP project, Jordan told him: 



Canonical transformations in the sense of Hamilton- Jacobi were . . . our 
daily bread in the preceding years, so to tie in the new results with those as 
closely as possible — that was something very natural for us to try (AHQP 
interview with Jordan, session 4, p. 11). 



In the classification of Goldstein et al. (cf. note 75), this corresponds to a gener- 
ating function of type 3, F^ ( p, Q). 

78 See Duncan and Janssen (j2009l . pp. 355-356) for a reconstruction of Jordan's 
proof of this result. 



46 



2.3 The realization of Jordan's postulates: probability amplitudes and canon- 
ical transformations 



At the beginning of sec. 4 of Neue Begriindung I, "General comments on the 
differential equations for the amplitudes," Jordan announced: 

To prove that our postulates are mathematically consistent, we want to give 
a new foundation of the theory — independently from the considerations in 
sec. 2 — based on the differential equations which appeared as end results 



there flJordanl . Il927bl . p. 821). 



He began by introducing the canonically conjugate variables a and /3, satisfy- 
ing, by definition, the commutation relation [&,$] = H/i. They are related to 
the basic variables p and q, for which the probability amplitude, according to 
Jordan's postulates, is (p\q) = e~ tpq / h (see Eq. (4)), via 

a = f(p,q)=TpT~\ (48) 

P = g(p,q)=TqT-\ (49) 

with T = T(p, q) [NBl, sec. 4, Eq. 1] 03 Note that the operator T(p, q) defined 
here is different from the operator T . . . = f dp (Q\p) . . . defined in sec. 2 (see 
Eq. (7), Jordan's Eq. (21)). The T(p, q) operator defined in sec. 4 is a similarity 
transformation operator implementing the canonical transformation from the 
pair (p,q) to the pair (&,$). We will see later that there is an important 
relation between the T operators defined in sees. 2 and 4. 

Jordan now posited the fundamental differential equations for the probability 
amplitude (q\/3) in his theory 

{ ; (jJ? 9 ) + 7^} m = °' [NB1, sec ' 4 ' Eq - 2a] (50) 

1 9 q^j ~ /Sj W) = 0. [NBl, sec. 4, Eq. 2b] (51) 

These equations have the exact same form as Eqs. (14)-(16) [NBl, sec. 2, Eqs. 
23ab], with the understanding that the operator T is defined differently. As 
Jordan put it in the passage quoted above, he took the equations that were 
the end result in sec. 2 as his starting point in sec. 4. 



79 The numbering of equations in Neue Begriindung I starts over in sec. 3, the first 
section of Part Two (77. Teil), and then again in sec. 4. Sees. 6 and 7, finally, have 
their own set of equation numbers. 

80 He introduced separate equations for the Erg dnzung Samplitude [NBl, sec. 4, Eqs. 
3ab] (see Eqs. (90)-(91) below). We ignore these additional equations for the mo- 
ment but will examine them for some special cases in sec. 2.4. 
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Before turning to Jordan's discussion of these equations, we show that they 
are easily recovered in the modern Hilbert space formalism. The result of the 
momentum operator a in Eq. (48) acting on eigenvectors \(3) of its conjugate 
operator (3 in Eq. (49) is, as we saw in Section 2.1 
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m = (52) 

Taking the inner product of these expressions with \q) and using that a = 
f(p, q), we find that 

Since p and q are represented by the differential operators (H/i)d/dq and q, 
respectively, in the g-basis, we can rewrite this as 

Combining these last two equations, we arrive at Eq. (50). Likewise, using 
that /3\/3) = (3\/3) and that (3 = g(p, q), we can write the inner product (q\{3\{3) 
as 

(q\m = Pm = g{j^,q ] )(qm, (55) 

where in the last step we used the representation of p and q in the g-basis. 
From this equation we can read off Eq. (51). 

We turn to Jordan's discussion of Eqs. (50)-(51) [NB1, sec. 4, Eqs. 2ab]. As 
he pointed out: 

As is well-known, of course, one cannot in general simultaneously impose 
two partial differential equations on one function of two variables. We will 
prove, however, in sec. 5: the presupposition — which we already made — that 
a and $ are connected to p and q via a canonical transformation (1) [our 
Eqs. (48)-(49)] is the ne cessary and suf ficient condition for (2) [our Eqs. 
(50)-(51)] to be solvable fljordanl . ll927bL p. 822; hats added). 



In sec. 5, "Mathematical theory of the amplitude equations," iJordanl (11927b . 
pp. 824-828) made good on this promise. To prove that the "presupposition" 



81 The complex conjugate of Eq. (33) can be written as 

(Q\p\q) = -^§- q (Q\q) = (Q\ ("7^) l9>- 

Since this holds for arbitrary \Q), it follows that p\q) = —(h/i)(d/dq)\q). This will 
be true for any pair of conjugate variables. 
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is sufficient, he used canonical transformations to explicitly construct a simul- 
taneous solution of the pair of differential equations (50)-(51) for probability 
amplitudes (ibid., pp. 824-825, Eqs. 9-17). He did this in two steps. 



(1) He showed that the sufficient condition for (Q\/3) to be a solution of 
the amplitude equations in the Q-basis, given that (q\/3) is a solution of 
these equations in the g-basis, is that (p, q) and (P, Q) are related by a 
canonical transformation. 

(2) He established a starting point for generating such solutions by show- 
ing that a very simple canonical transformation (basically switching p 
and q) turns the amplitude equations (50)-(51) into a pair of equations 
immediately seen to be satisfied by the amplitude (q\/3) = e iq @' h . 

With these two steps Jordan had shown that the assumption that P and Q 
are related to p and q through a canonical transformation is indeed a sufficient 
condition for the amplitude equations (50)-(51) [NB1, sec. 4, Eqs. 2ab] to be 
simultaneously solvable. We will cover this part of Jordan's argument in detail. 



The proof th at this assumpt ion is necessary as well as sufficient is much more 
complicated (jJordanl . Il927bl . pp. 825-828, Eqs. 18-34). The mathematical pre- 
liminaries presented in sec. 3 of Neue Begriindung I (ibid., pp. 816-821) are 
needed only for this part of the proof in sec. 5. We will cover neither this part 
of sec. 5 nor sec. 3. 



However, we do need to explain an important result that Jordan derived in 
sec. 5 as a consequence of this part of his proof (ibid., p. 828, Eqs. 35-40): 
Canonical transformations T(p, q) as defined above (see Eqs. (48)-(49) [NB1, 
sec. 4, Eq. 1]), which are differential operators once p and q have been replaced 
by their representations (h/i)d/dq and q in the g-basis, can be written as 
integral operators T as defined in sec. 2 (see Eq. (7) [NB1, sec. 2, Eq. 21]). 



This result is central to the basic structure of Jordan's theory and to the logic 
of his Neue Begriindung papers. It shows that Jordan's probability amplitudes 
do double duty as integral kernels of the operators implementing canonical 
transformations. As such, Jordan showed, they satisfy the completeness and 
orthogonality relations required by postulate C (see Eqs. (l)-(3) [NB1, sec. 2, 
Eqs. 14-17]). To paraphrase the characterization of Jordan's project by Hilbert 
et al. that we already quoted in Section 1.1, Jordan postulated certain relations 
between his probability amplitudes in Part One of his paper and then, in 
Part Two, presented "a simple analyti cal apparatus in which quantities occur 
that satisfy these relations exactly" (IHilbert. von Neumann, and Nordheiml . 



19281 . p. 2). These quantities, it turns out, are the integral kernels of canonical 



transformations. Rather than following Jordan's own proof of this key result, 
which turns on properties of canonical transformations, we present a modern 
proof, which turns on properties of Hilbert space and the spectral theorem. 
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But first we show, closely following Jordan's own argument in sec. 5 of Neue 
Begriindung I, how to construct a simultaneous solution of the differential 
equations (50)-(51) [NB1, sec. 4, Eqs. 2ab] for the amplitudes. Suppose we 
can exhibit just one case of a canonical transformation (p, q) — > (a, (3) (Eqs. 
(48)-(49) [NB1, sec. 4, Eq. 1]) where the amplitude e quat ions manifestly have 



a unique simultaneous solution. According to Jordanjj^J any other canonical 
pair can be arrived at from the pair (p, q) via a new transformation function 
S(P, Q), in the usual way 

p = SPS-\ q = SQS-\ (56) 

with S = S(P,Q) [NB1, sec. 5, Eq. 10]. The connection between the original 
pair (d, (3) and the new pair (P, Q) involves the composite of two canonical 
transformations [NB1, sec. 5, Eq. 11]: 

« = f(p, ?) = fiSPS' 1 , SQS~ l ) = F(P, Q), (57) 

h = 9(q, q) = g(SPS-\ SQS- 1 ) = G(P, Q). (58) 

In the new Q-basis, the differential equations (50)-(51) [NB1, sec. 4, Eqs. 2ab] 
for probability amplitudes take the form 

{ F {^W Q ) + 7^} {Qll3) = °' [NB1 ' sec ' 5 ' Eq ' 12a] (59) 

/ " T^ =0 ' [NBl, sec. 5, Eq. 12b] (60) 

Jordan now showed that 

(QW) = (7^.?) [NBl, sec. 5, Eq. 13] (61) 

is a simultaneous solution of the amplitude equations (59)-(60) in the Q-basis 
if (q\fi) is a simultaneous solution of the amplitude equations (50)-(51) in the 
g-basis. Using the operator S and its inverse S^ 1 , we can rewrite the latter 
aS 

S { f (i^' 9 ) + T^} S ~ 1S{ql ^ = °' [NB1 ' S6C - 5 ' Eq - Ma] (62) 
S |^ (j^, q) - A S^S (q\P) = 0, [NBl, sec. 5, Eq. 14b] (63) 



82 We will see below that this assumption is problematic. 

83 This step is formally the same as the one that got us from Eqs. (5)-(6) [NBl, sec. 
2, Eqs. 19ab] to Eqs. (14)-(16) [NBl, sec. 2, Eqs. 23ab]. 
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both taken, as in Eq. (61), at q = Q. Written more carefully, the first term in 
curly brackets in Eq. (62), sandwiched between S and S 1-1 , is 



-'(?4-)^-WS-)'(?t«)^K«)L- ™ 

With the help of Eq. (57), this can further be rewritten as 

S(P, Q) f(P, Q) S(P, Qr'lp^o =f{-^q\ (65) 

The second term in curly brackets in Eq. (62), sandwiched between S and 
S -1 , is simply equal to 

s ^ s (66) 

as S does not involve f3. Using Eqs. (61) and (64)-(66), we can rewrite Eq. 
(62) [NB1, sec. 5, Eq. 14a] as 

which is just Eq. (59) [NB1, sec. 5, Eq. 12a]. A completely analogous argument 
establishes that Eq. (63) [NB1, sec. 5, Eq. 14b] reduces to Eq. (60) [NB1, sec. 
5, Eq. 12b]. This concludes the proof that (Q\(3) is a solution of the amplitude 
equations in the new Q-basis, if (q\f3), out of which (Q\(3) was constructed 
with the help of the operator S implementing a canonical transformation, is 
a solution of the amplitude equations in the old g-basis. 

As S is completely general, we need only exhibit a single valid starting point, 
i.e., a pair (/, g) and an amplitude (q\/3) satisfying the amplitude equations 
in the g-basis (Eqs. (50)-(51) [NB1, sec. 4, Eqs. 2ab]), to construct general 
solutions of the amplitude equations in some new Q-basis (Eqs. (59)-(60) 
[NB1, sec. 5, Eqs. 12ab]). The trivial example of a canonical transformation 
switching the roles of coordinate and momentum does the trick (cf. Eqs. (57)- 
(58) [NB1, sec. 5, Eq. 11]): 

a = f(P, q) = -q, P = g(p, q) = V- [ NB1 ; sec- 5, Eq. 15] (68) 
In that case, Eqs. (50)-(51) become [NB1, sec. 5, Eq. 16] 

f^-A m = . (70) 
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Except for the minus signs, these equations are of the same form as the trivial 
equations (5)-(6) [NBl, sec. 2, Eqs. 19ab] for (p\q), satisfied by the basic 
amplitude (p\q) = e~ ipq ' h . In the case of Eqs. (69)-(70), the solution is: 

(q\P) = e iMh . [NBl, sec. 5, Eq. 17] (71) 

This establishes that the canonical nature of the transformation to the new 
variables is a sufficient condition for the consistency (i.e. simultaneous solv- 
ability) of the pair of differential equations (59)-(60) [NBl, sec. 5, Eq. 12ab] 
for the probability amplitudes. 



Jordan! (Il927bl . pp. 825-828) went on to prove the converse, i.e., that the 
canonical connection is also a necessary condition for the consistency of Eqs. 
(59)-(60). This is done, as Jordan explained at the top of p. 827 of his paper, 
by explicit construction of the operator S (in Eq. (61)), given the validity of 
Eqs. (59)-(60). We skip this part of the proof. 



Jordan fll927bl . p. 828) then used some of the same techniques to prove a key 



result in his theory. As mentioned above, we will appeal to the modern Hilbert 
space formalism and the spectral theorem to obtain this result. Once again 
consider Eq. (61): 

{Q\P) = {s{~,q\W)\ ■ [NBl, sec. 5, Eq. 13] (72) 



% dq ' 



<i= 



This equation tells us that the differential operator S((h/i) d/dq, q) maps ar- 
bitrary states (q\(3) in the g-basis (recall that (3 can be any operator) onto the 
corresponding states in the Q-basis. The spectral theorem, which gives 

us the resolution / dq\q)(q\ of the unit operator, tells us that this mapping can 
also be written as 

(Q\(3}= fdq(Q\q)(q\0). (73) 



Schematically, we can write 




jdq{Q\q)... (74) 



In other words, the probability amplitude (Q\q) is the integral kernel for the in- 
tegral representation of the canonical transformation operator S((h/i) d/dq, q). 
Using nothing but the properties of canonical transformations and his differ- 
ential equa^nsjorprobability amplitudes (Eqs. (50)-(51) [NBl, sec. 4, Eqs. 



2 ab]). I Jordan! ( 1l927bl . p. 828) derived an equation of exactly the same form as 



Eq. (74), which we give here in its original notation: 

= J dx.(p{y,x). . . . [NBl, sec. 5, Eq. 40] (75) 
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Jordan claimed that Eqs. (50)-(51) [NB1, sec. 4, Eqs. 2ab] contain both the 
time-independent and the time-dependent Schrodinger equations as special 
cases] 84 1 The time-independent Schrodinger equation is a special case of Eq. 
(51): 

If in (2b) we take f3 to be the energy W, and g to be the Hamiltonian 
function H(p, q) of the system, we obtain the Schrodinger wave equation, 
which corresponds to the classical Hamilton- Jacobi equation. With (2b) 
comes (2a) as a second equation. In this equation we ne ed to consider / to 
be the time t (as a function of p and q) (j Jordan! . Il927bl . p. 822). 



Actually, the variable conjugate to H would have to be minus t. For a = 
fiPiQ) = ~t an d $ = giftA) — H (with eigenvalues E), Eqs. (50)-(51) be- 
come: 

H^}<^=°- < 76) 

{H - E} (q\E) = 0. (77) 

If (q\E) is set equal to i/)e(q), Eq. (77) is indeed just the time-independent 
Schrodinger equation. 

Jordan likewise claimed that the time-dependent Schrodinger equation is a 
special case of Eq. (50) 

if for (3 we choose the time t [this, once again, should be —t], for g [minus] the 
time t(p,q) as fu nction o f p, q, a nd, correspondingly, for / the Hamiltonian 
function H(p,q) fljordanl . Il927bl p. 823). 



This claim is more problematic. For a = f(p,q) = H (eigenvalues E) and 
h = gipA) = ~t, Eqs. (50)-(51) [NB1, sec. 4, Eqs. 2ab] become: 



0. 



(7* 



{i-t}(q\t}=0. (79) 

If (q\t) is set equal to ip(q,t), Eq. (78) turns into the time-dependent Schro- 
dinger equation. However, time is a parameter in quantum mechanics and not 
an operator t with eigenvalues t and eigenstates \t). 



S5 



This also makes Eqs. (76) and (79) problematic. Consider the former. For a 
free particle, the Hamiltonian is H = p 2 /2m, represented by ((h/i)d/dq) 2 /2m 



Cf. iHilbert. von Neumann, and Nordheiml (jl928l . se c. 10, pp. 27 -29, "The 
Schrodinger differential equations"), discussed briefly by Jammer ( 19661 . p. 311). 
85 There is an extens i ve lite rature on this subject. For an introduction to this issue, 
;.g., iHilgevoord (120021 ) and the references therein. 



sec, c. 
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in the g-basis. The solution of Eq. (77), 

(q\E) = e^v^g/^ ( 80 ) 

is also a solution of Eq. (76) as long as we define t = mqp' 1 , as suggested by 
the relation q = (p/m)t, Note, however, that we rather arbitrarily decided on 
this particular ordering of the non-commuting operators p and q. Using that 

h d 




e iV2^Eg/h = ^2^E e iV ^ q/r \ (81) 
i oq 

we find that t (q\E) = mqp^ 1 (q\E) is given by: 

l 

i\/2mEq/h _ m( l e iV2mEq/h rg2) 

" V2^E 

This is indeed equal to (h/i)d/dE (q\E) as required by Eq. (76): 

iV2mEq/h _ m( l c iV2~mEq/h /gg\ 

i BE ~ y^E ' 1 ' 

So with t = mqp^ 1 , both Eq. (50) and Eq. (51) [NB1, sec. 4, Eqs. 2ab] hold in 
the special case of a free particle. It is not at all clear, however, whether this 
will be true in general. 

It is probably no coincidence that we can get Jordan's formalism to work, 
albeit with difficulty, for a free particle where the energy spectrum is fully 
continuous. Recall that, in Neue Begriindung I, Jordan restricted himself to 
quantities with completely continuous spectra. As he discovered when he tried 
to generalize his formalism to quantities with partly or wholly discrete spectra 
in Neue Begriindung II, this restriction is not nearly as innocuous as he made 
it sound in Neue Begriindung I. 

Consider the canonical transformation at = TpT" 1 (Eq. (48) [NB1, sec. 4, 
Eq. 1]) that plays a key role in Jordan's construction of the model realizing 
his postulates. Consider (in modern terms) an arbitrary eigenstate \p) of the 
operator p with eigenvalue p, i.e., p\p) = p\p). It only takes one line to show 
that then T\p) is an eigenstate of a with the same eigenvalue p: 

&T\p) = TpT^Tlp) = Tp\p) = pT\p). (84) 

In other words, the operators a and p connected by the canonical transforma- 
tion a = TpT~ x have the same spectrum. This simple observation, more than 
anything else, reveals the limitations of Jordan's formalism. It is true, as Eq. 
(77) demonstrates, that his differential equations Eqs. (50)— (51) [NB1, sec. 4, 
Eqs. 2ab] for probability amplitudes contain the time-independent Schrodinger 
equation as a special case. However, since the energy spectrum is bounded 
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from below and, in many interesting cases, at least partially discrete, it is 
impossible to arrive at the time-independent Schrodinger equation starting 
from the trivial equations (69)-(70) [NBl, sec. 5, Eq. 16] for the probability 
amplitude e l9/3///l between q and (3 — recall that (3 = p in this case (see Eq. 
(68)) — and performing some canonical transformation. As Eq. (84) shows, a 
canonical transformation cannot get us from p's and g's with completely con- 
tinuous spectra to d's and /3's with partly discrete spectra. This, in turn, 
means that, in many interesting cases (i.e., for Hamiltonians with at least 
partly discrete spectra), the time-independent Schrodinger equation does not 
follow from Jordan's postulates. In Jordan's defense one could note at this 
point that this criticism is unfair as he explicitly restricted himself to quanti- 
ties with fully continuous spectra in Neue Begriindung I. However, as we will 
see when we turn to Neue Begriindung II in Section 4, Jordan had to accept 
in this second paper that the extension of his general formalism to quantities 
with wholly or partly discrete spectra only served to drive home the problem 
and did nothing to alleviate it. 



2.4 The confusing matter of the Erganzungsamplitude 



In this subsection, we examine the "supplementary amplitude" (Erganzungs- 
amplitude) ip(x,y) that Jordan introduced in Neue Begriindung I in addition 



to the probability amplitude <p(x,y)\^] Jordan's (1927b, p. 813) postulate I 



sets the conditional probability Pr(x\y) that x has a value between x and 
x + dx given that y has the value y equal to: 

tp(x,y)ip*(x,y)dx. [NBl, sec. 2, Eq. 10] (85) 

Jordan allowed the eigenvalues x and y to be complex. He stipulated that 
the "star" in ijj*(x,y) is to be interpreted in such a way that, when taking 
the complex conjugate of ip(x,y), one should retain x and y and not replace 
them, as the "star" would naturally suggest, by their complex conjugates. The 
rationale for this peculiar rule will become clear below. 

For general complex amplitudes, Eq. (85) only makes sense as a positive real 
probability if the phases of ip(x,y) and ip*(x,y) exactly compensate, leaving 
only their positive absolute magnitudes (times the interval dx, as we are deal- 
ing with continuous quantities). Jordan certainly realized that in cases where 
the mechanical quantities considered were represented by self-adjoint oper- 



ators, this duplication was unnecessary] I He seems to have felt the need 



86 This subsection falls somewhat outside the main line of argument of our paper 
and can be skipped by the reader without loss of continuity. 

87 For instance, if x is a Cartesian coordinate and y is the Hamiltonian, the ampli- 
tudes ip(x, y) = tp(x, y) are just the Schrodinger energy eigenfunctions of the system 



55 



however, to advance a more general formalism, capable of dealing with the 
not uncommon circumstance that a canonical transformation of perfectly real 
(read "self-adjoint" in the quantum-mechanical case) mechanical quantities 
actually leads to a new canonically conjugate, but complex (read "non-self- 
adjoint") pair of quantities. An early example of this can be found in London's 
(1926b) solution of the quantum harmonic oscillator by canonical transforma- 
tion from the initial (q, p) coordinate-momentum pair to raising and lowering 



operators, which are obviously not self-adjoint (IDuncan and Janssenl . 12009k 
sec. 6.2, pp. 357-358). 

Jordan could hardly have been aware at this stage of the complete absence 
of "nice" spectral properties in the general case of a non-self-adjoint opera- 
tor, with the exception of a very special subclass to be discussed shortly. In 
contrast to the self-adjoint case, such operators may lack a complete set of 
eigenfunctions spanning the Hilbert space, or there may be an overabundance 
of eigenfunctions which form an "over-complete" set, in the sense that proper 
subsets of eigenfunctions may suffice to construct an arbitrary state. To the 
extent that eigenfunctions exist, the associated eigenvalues are in general com- 
plex, occupying some domain — of possibly very complicated structure — in the 
complex plane. In the case of the lowering operator in the simple h armonic 



oscillat or, the spectrum occupies the entire complex plane! Instead, [Jordan 



(Il927bl . p. 812) seems to have thought of the eigenvalue spectrum as lying on 



a curve even in the general case of arbitrary non-self-adjoint quantities. 

There is one subclass of non-self-adjoint operators for which Jordan's attempt 
to deal with complex mechanical quantities can be given at least a limited 
validity. The spectral theorem usually associated with self-adjoint and uni- 
tary operators (existence and completeness of eigenfunctions) actually extends 
with full force to the larger class of normal operators N, defined as satisfying 
the commutation relation [N, iV"'] = 0, which obviously holds for self-adjoint 



(N = N<) and unitary (iV' = N" 1 ) operators]. 8 ^ The reason that the spectral 
theorem holds for such operators is very simple: given a normal operator N, 
we may easily construct a pair of commuting self-adjoint operators: 

A^^N + N^), B = ±(N-fr). (86) 

It follows that A = A\B = B\ and that [A, B) = (i/2) [N, = 0. A well- 
known theorem assures us that a complete set of simultaneous eigenstates |A) 
of A and B exist, where the parameter A is chosen to label uniquely the state 



in coordinate space. 
88 For discussion of 



Dennerv and Krzvwicki 



rigorous discussion, see 



the s pecial case of finite Hermitian matrices, see 
(199 6|, sec. 24. 3 , pp. 177-178). For a more general and more 



von Neumann! ([19321 . Ch. II, sec. 10). 
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(we ignore the possibility of degeneracies here), with 



A\\) = a(\)\\), B|A)=/3(A)|A), N |A) = ("(A) |A), (87) 

where £(A) = a(A) + if3(\) are the eigenvalues of N = A + i-B. Of course, 
there is no guarantee that a(X) and /3(A) are continuously connected (once we 
eliminate the parameter A), so the spectrum of N (the set of points £(A) in the 
complex plane) may have a very complicated structure. For a normal operator, 
there at least exists the possibility though that the spectrum indeed lies on a 
simple curve, as assumed by Jordan. In fact, it is quite easy to construct an 
example along these lines, and to show that Jordan's two amplitudes, (p(x, y) 
and ip(x,y), do exactly the right job in producing the correct probability 
density in the (self-adjoint) x variable for a given complex value of the quantity 
y, in this case associated with a normal operator with a complex spectrum. 



Our example is a simple generalization of one that iJordanl (Il927bl . sec. 5, pp. 
830-831) himself gave (in the self-adjoint case). For linear canonical trans- 
formations, the differential equations specifying the amplitudes <p(x,y) and 
ip(x,y) [NB1, sec. 4, Eqs. 2ab and 3ab] are readily solved analytically. Thus, 
suppose that the canonical transformation from a self-adjoint conjugate pair 
(p, q) to a new conjugate pair (a, (3) is given by 

a = ap + bq, f3 = cp + dq, (88) 



where the coefficients a, 6, c, d must satisfy ad — be = 1 for the transformation 
to be canonical, but may otherwise be complex numbers [cf. NB1, sec. 5, Eqs. 
56-57]. The requirement that a be a normal operator (i.e., [a, cr] = 0) is 
easily seen to imply a/a* = b/b*. Thus, a and b have the same complex phase 
(which we may call e 1 ®). Likewise, normality of (3 implies that c and d have 
equal phase (say, e* x ). Moreover, the canonical condition ad — be = 1 implies 
that the phases e tip and e %x must cancel, so we henceforth set \ = — i?, and 
rewrite the basic canonical transformation as 

a = ((ap + bq), P = C(cp + dq), (89) 

where £ = e 1 ^, and a,b,c,d are now real and satisfy ad — be = 1. We see 
that the spectrum of lies along the straight line in the complex plane with 
phase — d (as the operator cp + dq is self-adjoint and therefore has purely real 
eigenvalues): the allowed values for /3 are such that (f3 is real. 



It is now a simple matter to solve the differential equations for the amplitude 
cp(q, (3) and t he supplementa ry amplitude ipiq, 0) in this case. The general 
equations are (IJordanl . Il927bl . sec. 4, p. 821)1 89 1 



Jordan ( 1927bl . p. 817) introduced the notation for the adjoint of F in sec. 3 



of his paper. 
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[NBl, 


sec. 


4, Eq. 2a] 


(90) 


[NBl, 


sec. 


4, Eq. 2b] 


(91) 


[NBl, 


sec. 


4, Eq. 3a] 


(92) 


[NBl, 


sec. 


4, Eq. 3b] 


(93) 



The differential operators in this case are (recall that a = f(p, q) and (3 
g(p,q) (see Eqs. (48)-(49) [NBl, sec. 4, Eq. 1]): 



\ i oq I \ i oq I 



9 = C \ c--z- + dq \ , g J = ([c-— + dq\ , 
\ % oq J y i oq J 

and for <p(q,/3) and ip(q,(3) we find (up to an overall constant factor): 



,(^)-xp{-l(^-I^ + ^(C^ 



HQ, P) =exp {— (^q 2 - k*f3q + ^(CP? 



(94) 



(95) 



We note that the basic amplitude <p(q, (3) is a pure oscillatory exponential, as 
the combinations ((3 and the constants a, c, and d appearing in the exponent 
are all real, so the exponent is overall purely imaginary, and the amplitude has 
unit absolute magnitude. This is not the case for ifj(q, (3), due to the appearance 
of (*, but at this point we recall that, according to Jordan's postulate A, 
the correct probability density is obtained by multiplying ip(x,y) by i/)*(x,y), 
where the star symbol includes the instruction that the eigenvalue y of y is 
not to be conjugated (cf. our comment following Eq. (85)). This rather strange 
prescription is essential if we are to maintain consistency of the orthogonality 
property 

J <p(x, y")ip*(x, y')dx = 6 yly » (96) 

with the differential equations (91) and (93) for the amplitudes [NBl, sec. 4, 
Eqs. 2b and 3b]. With this proviso, we find (recalling again that £/3 is real): 

i;*(q, (3) = exp jl fe 2 - k(3q + |-(C/3) 2 ) } = <p{q, (3), (97) 
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where the bar now denotes conventional complex conjugation, and we see that 
the product <p(q, j3)tp*(q, ($) = ip(q, /3)<p(q, (3) is indeed real, and in fact, equal 
to unity, as we might expect in the case of a purely oscillatory wave function. 

That Jordan's prescription for the construction of conditional probabilities 
cannot generally be valid in the presence of classical complex (or quantum- 
mechanically non-self-adjoint) quantities is easily verified by relaxing the con- 
dition of normal operators in the preceding example. In particular, we consider 
the example of the raising and lowering operators for the simple harmonic os- 
cillator, obtained a gain by a complex linear canonical transformation of the 



(q,p) canonical pairjj^j Now, as a special case of Eq. (88), we take 



« = -j=(p + iq) = f(p,q), P = -^(ip + <i) = g(p,q), (98) 

which, though canonical, clearly does not correspond to normal operators, as 
the coefficients a, b (and likewise c, d) are now 90 degrees out of phase. Solving 
the differential equations for the amplitudes, Eqs. (90)-(93) [NB1, sec. 4, Eqs. 
2ab, 3ab], we now find (up to an overall constant factor [cf. Eq. (95)]): 



<p(q, 13) =exp {--^ (q 2 - 2^q + /3 2 )}, 
1>(q, = exp | -L (g 2 - 2V2$q + (3 2 ) } = ^ (q, f3) = -3 



(99) 



<p(q,PY 



where (3 is an arbitrary complex number. In fact, the wave function ip(q, (3) is 
a square-integrable eigenfunction of (3 for an arbitrary complex value of (3: it 
corresponds to the well-known "coherent eigenstates" of the harmonic oscil- 
lator, with the envelope (absolute magnitude) of the wave function executing 
simple harmonic motion about the center of the potential well with frequency 
cu (given the Hamiltonian H = |(p 2 + u 2 q 2 )). The probability density in q 
of such a state for fixed (3 is surely given by the conventional prescription 
\(p(q,(3)\ 2 . On the other hand, for complex (3, the Jordan prescription requires 
us to form the combination (with the peculiar interpretation of the "star" in 
ijj*(q,(3), in which (3 is not conjugated): 



(100) 



which clearly makes no physical sense in this the state in question is a 

localized, square-integrable one. If Jordan's notion of an Erganzungsamplitude 
is to have any nontrivial content, it would seem to require, at the very least, 



90 As mentioned above, London ( 1926b) ha d looked at this example of a canonical 
transformation (jDuncan and Janssenl . I2009I . sec. 6.2, p. 358). 
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that the complex quantities considered fall into the very special category of 
normal operators after quantization. 



In fact , as we will see shortly, in the paper by lHilbert. von Neumann, and Nordheim 
( 119281 ) on Jordan's version of statistical transformation theory, the require- 
ment of self-adjointness already acquires the status of a sine qua non for 
physical observables in quantum theory, and the concept of an Erganzungs- 
amplitude disappears even from Jordan's own treatment of his theory after 
Neue Begriindung I. 



3 Hilbert, von Neumann, and Nordheim's Grundlagen (April 1927) 



In the winter semester of 1926/27, Hilbert gave a course entitled "Mathemat- 
ical methods of quantum theory." The course consisted of two parts. The first 
part, "The older quantum theory," was essentially a repeat of the course that 
Hilbert had given under the same title in 1922/23. The second part, "The new 
quantum theory," covered the developments since 1925. As he had in 1922/23, 
Nordhe im prepared the notes f or this course, which have recently been pub- 
lished ( Sauer and Majer . 20091 . pp. 504-707; the second part takes up pp. 
609-707). At the very end (ibid., pp. 700-706), we find a concise exposition 
of the main line of reasoning of Jordan's Neue Begriindung I. 



This 



resentation served as the basis for a paper by lHilbert. von Neumann, and Nordheim 
(jl928l ). As the authors explained in the introduction (ibid., pp. 1-2), "impor- 



tant parts of the mathematical elaboration" w ere due to von Neumann, wh ile 
Nordheim was responsible for the final text ( Duncan and Janssenl . 2009 . p. 
361). The paper was submitted to the Mathematische Annalen April 6, 1927, 
but, for reasons not clear to us, was only published at the beginnin g of the vol- 
ume for 1928. It thus appeared after the trilogy by von Neumann ( 1927a b| 3) 
that rendered much of it obsolete. In this section we go over the main points 
of this three-man paper 



91 



In the lecture notes for Hilbert 's course, Dirac is not mentioned at all, and even 



though in the paper it is acknowledged that iDirad (119271 ) had independently 
arrived at and published similar results, the focus continues to be on Jordan. 
There are only a handful of references to Dirac, most importantly in connection 

with the delta function and in the discussion of the Schrodinger equation for a 

Hamiltonian with a partly discrete spectrum ( jHilbert. von Neumann, and Nordheiml . 



sec 



91 For o t her discussions of iHilbert. von Neumann, and Nordheim ( 192g ) 

Jammerl dl966l pp. 3 09-312: cf. note 30 k llVlehra and Rechenberel (|2000-200ll . pp. 
404-411), and Lacki ( 2000l . pp. 295-300, focusing mainly on the paper's axiomatic 
structure). 
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19281 . p. 8 and p. 30, respectively). Both the lecture notes and the paper stay 
close to the relevant sections of Neue Begriindung I, but Hilbert and his col- 
laborators did change Jordan's notation. Their notation is undoubtedly an 
improvement over his — not a high bar to clear — but the modern reader trying 
to follow the argument in these texts may still want to translate it into the 
kind of modern notation we introduced in Section 2. We will adopt the nota- 
tion of Hilbert and his co-authors in this section, except that we will continue 
to use hats to distinguish (operators representing) mechanical quantities from 
their numerical values. 

As we mentioned in S ection 2.1, when we discussed postulates A th rough D 



of Neue Begriindung I, iHilbert. von Neumann, and Nordheiml (119281 p p. 4-5) 



based their exposition of Jordan's theory on six "physical axioms."! 92 1 Axiom 
I introduces the basic idea of a probability amplitude. The amplitude for the 
probability that a mechanical quantity Fi(ftq) (some function of momentum 
p and coordinate q) has the value x given that another such quantity F 2 (pq) 
has the value y is written as <p(xy; F\ F 2 ). 

Jordan's Erqdnzu nqsamplitude still made a brief appearance in the notes for 



Hilbert's course fjSauer and Majerl . |2009| . p. 700) but is silently dropped in 



the paper. As we saw in Section 2, amplitude and supplementary amplitude 
are identical as long as we only consider quantities represented, in modern 
terms, by Hermitian operators. In that case, the probability w(xy; FiF 2 ) of 
finding the value x for F\ given the value y for F2 is given by the product 
of <p(xy; F\ F 2 ) and its complex conjugate, which, of course, will always be a 
real number. Although they did not e xplicitly point out that this eliminates 



the ne ed for the Erganzungsamplitude, IHilbert. von Neumann, and Nordheim 



(119281 . p. 17-25) put great emphasis on the restriction to Hermitian operators. 
Sees. 6-8 of their paper ( "The reality conditions," "Properties of Hermitian 
operators," and "The physical meaning of the reality conditions") are devoted 
to this issue. 

Axiom II corresponds to Jordan's postulate B and says that the amplitude 
for finding a value for F 2 given the value of F\ is the complex conjugate of 
the amplitude of finding a value for F\ given the value of F 2 . This symmetry 
property entails that these two outcomes have the same probability. Axiom 
III is not among Jordan's postulates. It basically states the obvious demand 
that when Fi = F 2 , the probability w(xy; F\ F 2 ) be either (if x ^ y) or 1 (if 
x = y). Axiom IV corresponds to Jordan's postulate C and states that the am- 
plitudes rather than the probabilities themselves follow the usual composition 



92 In the lecture notes we find four axiom s that are essentially the same as Jordan's 
four postulates ( Sauer and Majer . 20091 . pp. 700-701). 
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rules for probabilities (cf. Eqs. (1) and (3) in Section 2.1): 

<p{xz\F 1 F*) = J V (xy;F 1 F 2 ) V (yz;F 2 F 3 )dy. (101) 
Though they did not use Jordan's phrase "interference of probabilities," the 



authors emphasized the central importance of this particular axiom 



This requirement [Eq. (101)] is obviously analogous to the addition and mul- 
tiplication theorems of ordinary probability calculus, except that in this case 
they hold for the amplitudes rather than for the probabilities themselves. 

The characteristic difference to ordinary probability calculus lies herein 
that initially, instead of the probabilities themselves, amplitudes occur, 
which in general will be complex quantities and only give ordinary probabili- 



ties if their absolute value is taken and then squared (IHilbert. von Neumann, and Nordheim 



19281 . p. 5) 



Axiom V, as we already mentioned in Section 2.1, makes part of Jordan's 
postulate A into a separate axiom. It demands that probability amplitudes 
for quantities F\ and F 2 depend only on the functional dependence of these 
quantities on q and p and not on "special properties of the system under 
consideration, such as, for example, its Hamiltonian" (ibid., p. 5). Axiom VI, 
finally, adds another obvious requirement to the ones recognized by Jordan: 
that probabilities be independent of the choice of coordinate systems. 



Before they introduced the axioms. IHilbert. von Neumann, and Nordheiml (11928 



p. 2) had already explained, in a passage that we quoted in Section 1.1, that 
the task at hand was to find "a simple analytical apparatus in which quan- 
tities occur that satisfy" axioms I- VI. As we know from Neue Begriindung 
I, the quantities that fit the bill are the integral kernels of certain canoni- 
cal transformations, implemented as TpT -1 and TqT" 1 (cf. Eqs. (48)-(49)). 
After introducing this "simple analytical apparatus" in sees. 3-4 ( "Basic for- 
mulae of the operator calculus," "Canonical operators and transformations"), 
the authors concluded in sec. 5 ("The physical interpretation of the operator 
calculus" ) : 

The probability amplitude (p(xy;qF) between the coordinate q and an ar- 
bitrary mechanical quantity F{qp) — i.e., for the situation that for a given 
value y of F, the coordinate lies between x and x + dx — is given by the kernel 
of the integral operator that canonically transforms the operator q into the 



opera tor corresponding to the mechanical quantity F(qp) (IHilbert. von Neumann, and Nordheiml . 



19281 . p. 14; emphasis in the original, hats added). 



They immediately generalized this definition to cover the probability ampli- 
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Recall, however, Heisenberg ' s critic ism of this aspect of Jordan's work in the 



uncertainty paper (jHeisenbergl . Il927bl . pp. 183-184; cf. note 38). 
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tude between two arbitrary quantities F 1 and F 2 . In sec. 3, the authors already 
derived differential equations for integral kernels <p(xy) (ibid., pp. 10-11, Eqs. 
(19ab) and (21ab)). Given the identification of these integral kernels with 
probability amplitudes in sec. 5, these equation are just Jordan's fundamental 
differential equations for the latter (NB1, sec. 4, Eqs. (2ab); our Eqs. (50)-(51) 
in Section 2.3). 

In sec. 4, they also stated the key assumption that any quantity of inter- 
est can be obtained through a canonical transformation starting from some 
canonically conjugate pair of quantities p and q: 

We will assume that every operator F can be generated out of the basic 
operator q by a canonical transformation. This statement can also be ex- 
pressed in the following way, namely that, given F, the operator equation 
TqT~ l has to be solvable. 

The conditions that F has to satisfy for this to be possible will not be 



investigated here (IHilbert. von Neumann, and Nordheiml . Il928l . p. 12; hats 
added) . 

What this passage suggests is that the authors, although they recognized the 
importance of this assumption, did not quite appreciate that, as we showed 
at the end of Section 2.3, it puts severe limits on the applicability of Jordan's 
formalism. In the simple examples of canonical transformations (F = f(q) and 
F = p) that they considered in sec. 9 ("Application of the theory to special 
cases"), the assumption is obviously satisfied and the formalism works just 
fine (ibid., pp. 25-26). In sec. 10 ("The Schrodinger differential equations"), 
however, they set F equal to the Hamiltonian H and claimed that one of the 
differential equations for the probability amplitude ip(xW; q H) (where W is 
an energy eigenvalue) is the time-independent Schrodinger equation. As soon 
as the Hamiltonian has a wholly or partly discrete spectrum, however, there 
simply is no operator T such that H = TqT^ 1 . 



In see s. 6-8, which we already briefly mentioned above. IHilbert. von Neumann, and Nordheim 



(119281 . pp. 17-25) showed that the necessary and sufficient condition for the 
probability w(xy; Fi F 2 ) to be real is that Fi and F 2 are both represented by 
Hermitian operators. As we pointed out earlier, they implicitly rejected Jor- 
dan's attempt to accommodate F's represented by non-Hermitian operators 
through the introduction of the Erganzungsamplitude. They also showed that 
the operator representing the canonical conjugate G of a quantity F repre- 
sented by a Hermitian operator is itself Hermitian. 

The authors ended their paper on a cautionary note emphasizing its lack 
of mathematical rigor. They referred to von Neumann's (1927a) forthcoming 
paper, Mathematische Begriindung, for a more satisfactory treatment of the 
Schrodinger equation for Hamiltonians with a partly discrete spectrum. In the 
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concluding paragraph, they warned the reader more generally: 



In our presentation the general theory receives such a perspicuous and for- 
mally simple form that we have carried it through in a mathematically still 
imperfect form, especially since a fully ri gorous presentation might well be 
consi derably more tedious and circuitous (jHilbert. von Neumann, and Nordheim . 



19281 . p. 30). 



4 Jordan's Neue Begriindung II (June 1927) 



In April and May 1927, while at Boh r's insti t ute in Copenhagen on an Inter- 



national Education Board fellowship, iJordanl (jl927gl ) wrote Neue Begriindung 



II, which was received by Zeitschrift fur Physik June 3, 1927|_^j In the abstract 



he announced a "simplified and generalized" version of the theory presented 
in Neue Begriindung I 
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One si mplification was that Jordan, like lHilbert. von Neumann, and Nordheim 



(119281 ). dropped the Ergdnzungsamplitude and restricted himself accordingly 
to physical quantities represented by Hermitian operators and to canonical 
transformations preserving Hermiticity. Another simplification was that he 
adopted Dirac's (1927) convention of consistently using the same letter for a 
mechanical quantity and its possible values, using primes to distinguish the 
latter from the former. When, for instance, the letter f3 is used for some quan- 
tity, its values are denoted as /?', (3", etc. We will continue to use the notation 
(3 for the quantity (and the operator representing that quantity) and the no- 
tation (3, (3', . . . for its values. While this new notation for quantities and their 
values was undoubtedly an improvement, the new notation for probability 



94 Not long after leaving for Copenhagen, Jordan wrote a long letter to Dirac, who 
was still in Gottingen, touching on some of the issues addressed in Neue Begriindung 
II. Jordan wrote: "Following our conversation a few days ago I want to write you 
a little more. It's about a few things connected to those we discussed. Most of 
these considerations originate in the fall of last year. Back then, however, I didn't 
succeed in achieving a complete clarification of the question that interested me, and 
later I haven't really had the courage to take up the issue again. It would certainly 
please me if you could make some progress on these questions" (Jordan to Dirac, 
April 14, 1927, AHQP). Unfortunately, w e do not have Dirac's reply. In between 
Neue Begriindung I and II, Jordan ( 1927el ) published a short paper showing that his 



theory has the desirable feature that the conditional probability of finding a certain 
value for some quantity is independent of the scale used to measure that quantity. 
95 When Kufm complained about the "dreadful notation" of Neue Begriindung I 
in his interview with Jordan for the AHQP project, Jordan said that in Neue 
Begriindung II he just wanted to give a "prettier and clearer" exposition of the 
same material (session 3, p. 17, quoted by Duncan and Janssen, 2009, p. 360). 



61 



amplitudes and for transformation operators with those amplitudes as their 
integral kernels is actually more cumbersome than in Neue Begriindung I. 



In the end, however, these new notational complications only affect the cos- 
metics of the paper. What is more troublesome is that the generalization of 
the formalism promised in the abstract to handle cases with wholly or partly 
discrete spectra is much more problematic than Jordan suggested and, we 
argue, ultimately untenable. By the end of Neue Begriindung II, Jordan is 
counting quantities nobody would think of as canonically conjugate (e.g., dif- 
ferent components of spin) as pairs of conjugate variables and has abandoned 
the notion, central to the formalism of Neue Begriindung I, that any quantity 
of interest (e.g, the Hamiltonian) is a member of a pair of conjugate variables 
connected to some initial pair of p's and g's by a canonical transformation. 
It is fair to say that, although Jordan was still clinging to his p's and g's in 
Neue Begriindung II, they effectively ceased to play any significant role in his 
formalism. 

As we showed at the end of Section 2.3, the canonical transformation 

a = TpT-\ p = TqT- 1 (102) 

(cf. Eqs. (48)-(49)) can never get us from a quantity with a completely con- 
tinuous spectrum (such as position or momentum) to a quantity with a wholly 
or partly discrete spectrum (such as the Hamiltonian). In Neue Begriindung 



II, iJordanl (Il927gl . pp. 16-17) evidently recognized this problem even though 
it is not clear that he realized the extent to which this undercuts his entire 
approach. 

The ce n tral pr oblem is brought out somewhat indirectly in the paper. As 



Jordan! (Il927gl . pp. 1-2) already mentioned in the abstract and then demon- 
strated in the introduction, the commutation relation, [p,q] = H/i, for two 
canonically conjugate quantities p and q cannot hold as soon as the spectrum 
of one of them is partly discrete. Specifically, this means that action-angle vari- 
ables J and w, where the eigenvalues of the action variable J are restricted to 
integral multiples of Planck's constant, cannot satisfy the canonical commu- 
tation relation. 



The proof of this claim is very simple. IJordanl (jl927gi p. 2) considered a pair 



of conjugate quantities a and f3 where (3 is assumed to have a purely discrete 
spectrum. We will see that one runs into the same problem as soon as one a 
or (3 has a single discrete eigenvalue. Suppose a and (3 satisfy the standard 
commutation relation: 

[&J] = i (103) 
As Jordan pointed out, it then follows that an operator that is some function 
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F of /3 satisfie: 



[a,F0)} = -F'0). (104) 

Jordan now chose a function such that F(j3) = for all eigenvalues fa, fa, ■ ■ ■ 
of fa while F'{fa) 7^ at those same points. In that case, the left-hand side of 
Eq. (104) vanishes at all these points, whereas the right-hand side does not. 
Hence, Eq. (104) and, by modus tollens, Eq. (103) cannot hold. 

Since Eq. (104) is an operator equation, we should, strictly speaking, compare 
the results of the left-hand side and the right-hand side acting on some state. 
To show that Eq. (104) — and thereby Eq. (103) — cannot hold, it suffices to 
show that it does not hold for one specific function F and one specific state 
\ip). Consider the simple function F\(j3) — ^ — fa, for which Fi(fa) = and 
F[{fa) = 1, and the discrete (and thus normalizable) eigenstate \fa) of the 
operator corresponding to the quantity fa Clearly, 

(falfoF^faUfa) = (fa\[aj-fa]\fa) = 0, (105) 

as fa fa) = fa\fa), while 

(fa\{U/i)F[0)]\fa) = ^(fa\fa) = -, (106) 

1 1 

as F[0)\fa) = F[{fa)\fa) = \ fa). This shows that the relation, 

and hence Eqs. (103)-(104), cannot hold. The specific example F±(fa = (3 — fa 
that we used above immediately makes it clear that the commutation relation 
[a, fa = h/i cannot hold as soon as either one of the two operators has a single 
discrete eigenvalue. 



Much later in the paper, in sec. 4 ("Canonical transformations"), [Jordan 



(jl927gi . p. 16) acknowledged that it follows directly from this result that no 
canonical transformation can ever get us from a pair of conjugate variables 
p's and g's with completely continuous spectra to d's and fas with partly 



<)(-» 



If the function F(/3) is assumed to be a polyn o mial, c n n , which is all we 



need for what we want to prove, although I Jordan! (|1927eJ . p. 2) considered a "fully 
transcendent function," Jordan's claim is a standard result in elementary quantum 
mechanics: 

[a, F{$)] = [a, > > n f3 n } = > > n n ^ $ n ~ l = ^ [ » n $ n ) = - { F'0), 



n n a P \ n / 



where in the second step we repeatedly used that [a, fa = h/i and that [A, BC] 
[A, B]C + B[A, C] for any three operators A, B, and C. 
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discrete spectra. It is, after all, an essential property of canonical transforma- 
tions that they preserve canonical commutation relations. From Eq. (102) and 
[p, q] = h/i it follows that 

[a J] = \TpT-\TqT~ 1 } = T\p, q\T~ l = h/i. (108) 

Since, as we just saw, only quantities with purely continuous spectra can 
satisfy this commutation relation, Eq. (108) cannot hold for a's and /3's with 
partly discrete spectra and such a's and /3's cannot possibly be obtained from 
p and q through a canonical transformation of the form (102). 

We will discuss below how this obstruction affects Jordan's general formalism. 
When Jordan, in the introduction of Neue Begriindung II, showed that no 
quantity with a partly discrete spectrum can satisfy a canonical commutation 
relation, he presented it not as a serious problem for his formalism but as an 
argument for the super iority of his alte rnative definition of conjugate variables 



in Neue Begriindung I (jJordanl . Il927bl . p. 814, cf. Eq. (4)). In that definition p 
and q are considered canonically conjugate if the probability amplitude (p(p, q) 
has the simple form e~ ipq ^ h , which means that as soon as the value of one of 
the quantities p and q is known, all possible values of the other quantity are 
equiprobable. As we saw in Section 2.1, Jordan showed that for p's and g's 
with purely continuous spectra this implies that they satisfy [p, q] = h/i (cf . 
Eq. (34)), which is the standard definition of what it means for p and q to 



be conjugate variables. In Neue Begriindung II, Jordan ( 1927d . p. 6, Eq. (C)) 



extended his alternative definition to quantities with wholly or partly discrete 
spectra, in which case the new definition, of course, no longer reduces to the 
standard one. 



As iJordanl (Il927d ) wrote in the opening paragraph, his new paper only as- 
sumes a rough familiarity with the considerations of Neue Begriindung I. He 
thus had to redevelop much of the formalism of his earlier paper, while trying 
to both simplify and generalize it at the same time. In sec. 2 ("Basic proper- 
ties of quantities and probability amplitudes"), Jordan began by restating the 
postulates to be satisfied by his probability amplitudes. 

He introduced a new notation for these amplitudes. Instead of ip(/3, q) (cf. note 
58) he now wrote <& ap (f3',q'). The primes, as we explained above, distinguish 
values of quantities from those quantities themselves. The subscripts a and p 
denote which quantities are canonically conjugate to the quantities $ and q for 
which the probability amplitude is being evaluated. As we will see below, one 
has a certain freedom in picking the a and p conjugate to (3 and q, respectively, 
and settling on a specific pair of a and p is equivalent to fixing the phase 
ambiguity of the amplitude <p((3, q) up to some constant factor. So for a given 
choice of a and p, the amplitude § ap ((3', q') is essentially unique. In this way, 



Jordan! (Il927gl . p. 20) could answer, at least formally, von Neumann's (1927a, 



p. 3) objection that probability amplitudes are not uniquely determined even 
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though the resulting probabilities are. It is only made clear toward the end of 
the paper that this is the rationale behind these additional subscripts. Their 
only other role is to remind the reader that <& ap (/3' ,q') is determined not by 
one Schrodinger-type equation in Jordan's formalism but by a pair of such 
equati ons invq l ying b oth canonically conjugate pairs of variables, (p, q) and 
(&, p) (IJordanl . Il927gl p. 20). As none of this is essential to the formalism, we 
will simply continue to use the notation (J3\q) for the probability amplitude 
between the quantities /3 and q. 



Jordan also removed the restriction to systems of one degree of freed om that 
he had adopted for convenience in Neue Begriindungl ( IJordanl . Il927bl . p. 810). 
So q, in general, now stands for (qi, . . . qf), where / is the number of degrees of 
free dom of the syste m under consideration. The same is true for other quanti- 



ties. IJordanl ( 11927gl pp. 4-5) spent a few paragraphs examining the different 
possible structures of the space of eigenvalues for such /-dimensional quanti- 
ties depending on the nature of the spectrum of its various components — fully 
continuous, fully discrete, or combinations of both. He also introduced the 
notation 8 ((3' — (3") for a combination of the Dirac delta function and the Kro- 
necker delta (or, as Jordan called the latter, the "Weierstrassian symbol"). 



In Neue Begriindung II (IJordanl . Il927gl p. 6), the four postulates of Neue 



Begriindung I (see our discussion in Section 2.1) are replaced by three postu 
lates or "axioms," as Jordan now also call ed them, numbered with Roman nu- 
meral s. This may have been in deference to lHilbert. von Neumann, and Nordheim 
( 119281 ). although they listed six such axioms (as we saw in Section 3). Jordan's 
new postulates or axioms do not include the key portion of postulate A of Neue 
Begriindung I stating the probability interpretation of the amplitud es. That 
is rele gated to sec. 5, "The physical meaning of the ampli tudes" (I Jordan . 



Jordan! rtl927gL 



)27gl p. 19). Right before listing the postulates, however 
p. 5) did mention that he will only consider "real (Hermitian) quantities," 
thereby obviating the need for the Ergdnzungsamplitude and simplifying the 
relation between amplitudes and probabilities. There is no discussion of the 
Ergdnzungsamplitude amplitude in the pape r. Instead, following the lead of 
Hilbert. von Neumann, and Nordheim! (119281 ). Jordan silently dropped it. It 
is possible that this was not even a matter of principle for Jordan but only 
one of convenience. Right after listing the postulates, he wrote that the restric- 
tion to real quantities is made only "on account of simplicity" (der Einfachkeit 
halber, ibid., p. 6). 



Other than the probability-interpretation part of postulate A, all four pos- 
tulates of Neue Begriindung I return, generalized from one to / degrees of 
freedom and from quantities with completely continuous spectra to quanti- 
ties with wholly or partly discrete spectra. Axiom I corresponds to the old 
postulate D. It says that for every generalized coordinate there is a conju- 
gate momentum. Axiom II consists of three parts, labeled (A), (B), and (C). 
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Part (A) corresponds to the old postulate B, asserting the symmetry property, 
which, in the new notation, becomes: 



^ ap ((3',q') = ^ pa ( q ',(3' 



(109) 



Part (B) corresponds to the old postulate C, which gives the basic rule for the 
composition of probability amplitudes, 



J2 ®ap(P', q')%p{q', Q') = *«pO0', Q') 



(110) 



where the notation J2q indicates that, in general, we need a combination of 
integrals over the continuous parts of the spectrum of a quantity and sums 
over its discrete parts. In Eq. (110), J2q refers to an ordinary integral as the 
coordinate q has a purely continuous spectrum. Adopting this notation, we 
can rewrite the composition rule (110) in the modern language introduced in 
Section 2 and immediately recognize it as a completeness relation (cf. Eqs. (1) 
and (3)): 

J2({3\q)(q\Q) = (P\Q)- (HI) 



We can likewise formulate orthogonality relations, as I Jordan! (jl927gi . p. 7, Eq. 
(5)) did at the beginning of sec. 2 ("Consequences"): 



Y J W)W) = K<i 



(112) 



Recall that S(j3 — /?') can be either the Dirac delta function or the Kronecker 
delta, as f3 can have either a fully continuous or a partly or wholly discrete 
spectrum. The relations in Eq. (112) can, of course, also be read as complete- 
ness relations, i.e., as giving two different resolutions, 



&><9l> EI/WI, 

1 



(113) 



of the unit operator. Part (C) of axiom II is the generalization of the definition 
of conjugate variables familiar from Neue Begriindung I to / degrees of freedom 
and to quantities with wholly or partly discrete spectra. Two quantities at = 
(ax, •••,«/) and (3 = (fix, ... , fif) are canonically conjugate to one another if 



a) = Ce 



ELx^k)/n 



(114) 



wher e C is a normalization constant] 97 1 Axiom HI, finally, is essentially axiom 
III of lHilbert. von Neumann, and Nordheiml (119281 . p. 4), which was not part 



97 Contrary to what I Jordanl (1927jJ, p. 7) suggested, the sign of the exponent in Eq. 
(114) agrees with the sign of the exponent in the corresponding formula in Neue 



Begriindung I ()Jordanl . Il927bl . p. 814, Eq. 18; our Eq. (4)). 
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of Neue Begriindung I and which says, in our notation, that (/3\/3') = 6(/3 — /3'), 
where, once again, 5(/3 — /3') can be either the Dirac delta function or the 
Kronecker delta. 



We need to explain one more aspect of Jordan's notation in Neue Begriindung 
II. As we have seen in Sections 2 and 3, probabili ty ampl i tudes do double 
duty as integral kernels of canonical transformations. iJordanl (Il927gl . p. 6, Eqs. 
(A')-(B / )) introduced the special notation $^ to indicate that the amplitude 
$ap(/3', q') serves as such an integral kernel, thinking of $^ as 'matrices' with 
f3 and q as 'indices' that, in general, will take on both discrete and continuous 
values. We will continue to use the modern notation (/3\q) both when we want 
to think of this quantity as a probability amplitude and when we want to 
think of it as a transformation 'matrix'. The notation of Neue Begriindung II, 
like the modern notation, clearly brings out the double role of this quantity. 
In Neue Begriindung I, we frequently encountered canonical transformations 
such as a = TpT' 1 , /3 = TpT~ l (cf. Eqs. (48)-(49)). In Neue Begriindung 
II, such transformations are written with $^'s instead of T's. As we will 
explain in detail below, this conceals an important shift in Jordan's usage of 
such equations. This shift is only made explicit in sec. 4, which, as its title 
announces, deals specifically with "Canonical transformations." Up to that 
point, and especially in sec. 3, "The functional equations of the amplitudes," 
Jordan appears to be vacillating between two different interpretations of these 
canonical transformation equations, the one of Neue Begriindung I in which a 
and /3 are new conju gate yariable s di fferent f r om the p and q we started from, 
and one, inspired by iDirad (119271 ). as I Jordan! ( Il927gl pp. 16-17) acknowledged 
in sec. 4, in which a and (5 are still the same p and q but expressed with respect 
to a new basis, 



9S 



Before he got into any of this, IJordanl ( 11927gl sec. 2, pp. 8-10) examined five 
examples, labeled (a) through (e), of what he considered to be pairs of conju- 
gate quantities and convinced himself that they indeed qualify as such under 
his new definition (114). More specifically, he checked in these five cases that 
these purportedly conjugate pairs of quantities satisfy the completeness or 
orthogonality relations (112). The examples include familiar pairs of canon- 
ically conjugate variables, such as action-angle variables (Jordan's example 
(c)), but also quantities that we normally would not think of as conjugate 
variables, such as different components of spin (a special case of example (e)). 
We take a closer look at these two specific examples. 



In example (c), the allegedly conjugate variables are the angle variable w with 
a purely continuous spectrum and eigenvalues w G [0, 1] (which means that 
the eigenvalues of a true angle variable d = 2irw are d G [0, 2ir}) and the action 
variable J with a purely discrete spectrum and eigenvalues J = C + nh, where 



as 



Cf. the letter from Dirac to Jordan quoted in Section 1.1 (cf. note 42). 
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C is an arbitrary (real) constant and n is a positive integer. For convenience we 
set C = — 1, so that J = mh with m — 0, 1, 2, . . . The probability amplitude 
(w\ J) has the form required by Jordan's definition (114) of conjugate variables, 
with a = J, (3 = w, and / = 1: 

(w\J) =e iwJ/h . (115) 
We now need to check whether (w\J) satisfies the two relations in Eq. (112): 



dw (J ni \w) (w\J ri2 ) = 5 nin2 , ( w \ J n){Jn\ w ') = S( w ~ w ')- ( 116 ) 

n=0 

Using Eq. (115), we can write the integral in the first of these equations as: 
dw (w\J n2 )(w\J ni )* = C dwe 2 ^~ n ^ w = 5 nin2 . (117) 



Hence the first relation indeed holds. We can similarly write the sum in the 
second relation as 

oo oo 

£ (w\J n )(w'\J n )* = £ e 2 ™(— '). (118) 

n=0 n=0 

Jordan set this equal to 5(w — w'). However, for this to be true the sum over 
n should have been from minus to plus infinityP^I If the action-angle vari- 
ables are (L z ,0), the ^-component of angular momentum and the azimuthal 
angle, the eigenvalues of J are, in fact, ±mh with m — 0, 1, 2, . . ., but if the 
action variable is propor tional to the energy, as it is in many applications in 
the old quantum theory (IDuncan and Janssenl . 120071 . Pt. 2, pp. 628-629), the 
spectrum is bounded below. So even under Jordan's alternative definition of 
canonically conjugate quantities, action-angle variables do not always qualify. 
However, since action-angle variables do not play a central role in the Neue 
Begriindung papers, this is a relatively minor problem. 



99 A quick way to see this is to consider the Fourier expansion of some periodic 
function f(w): 

oo „! 

f(w)= V c n e 27Tinw , with Fourier coefficients c n = / dw' e~ 2ninw ' f(w'). 

Substituting the expression for c n back into the Fourier expansion of f(w), we find 

f(w) = £ dw' ^ f; e^(«-« j f( w >), 

which means that the expression in parentheses must be equal to 5(w — w'). Note 
that the summation index does not run from to oo, as in Eq. (118), but from — oo 
to +oo. 
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We turn to Jordan's example (e), the other example of supposedly conjugate 
quantities that we want to examine. Consider a quantity /3 with a purely dis- 
crete spectrum with N eigenvalues 0, 1,2,..., AT — 1. Jordan showed that, if 
the completeness or orthogonality relations (112) are to be satisfied, the quan- 
tity a conjugate to (3 must also have a discrete spectrum with N eigenvalues 
hk/N wher e k = 0, 1 , 2, . . . , N — 1. We will check this for the special case that 



iV = 2. As iJordanl ( 11927gl pp. 9-10) noted, this corresponds to the case of 



electron spin. In sec. 6, "On the theory of the m agnetic electro n," he returned 



to the topic of spin, acknowledging a paper by iPaulil (|l927bh on the "mag- 
netic electron," which he had read in manuscript (IJordanl . Il927gi p. 21, note 
2). Pauli's discussion of spin had, in fact, been an important factor prompting 
Jordan to write Neue Begriindung II: 

But the magnetic electron truly provides where the older canonical 

commutation relations completely fail; the desire to fully understand the 
relations encounte red in this case was an important reason for carrying out 



this investigation ( IJordanl . Il927gl p. 22). 



The two conditions on the amplitudes (/3\a) = Ce % ^ a ^ h (Eq. (114) for / = 1) 
that need to be verified in this case are (cf. Eq. (112)): 

2 2 

YK a m\Pk)(/3k\a n ) = <W, Y^(P™\ a k)( a k\Pn) = Km- (119) 
k=l k=l 

Inserting the expression for the amplitudes, we can write the first of these 
relations as 

E(AoK)(AoK>* = £<7V/w— »)/» = c 2 (i + e *K-a m )A) ) (120) 

k=l k=l 

where in the last step we used that fa = and fa — 1. The eigenvalues of a in 
this case are aii = and «2 = h/2. For m = n, the right-hand side of Eq. (120) 
is equal to 2C 2 . For m ^ n, e < a "- a ^)/ h = e ± l7T = _1 an d the right-hand side 
of Eq. (120) vanishes. Setting C = l/y/2, we thus establish the first relation 
in Eq. (119). A completely analogous argument establishes the second. 

This example is directly applicable to the treatment of two arbitrary compo- 
nents of the electron spin, a = (a x , a y ,a z ), say the x- and the z-components, 
even though the eigenvalues of both a x and a z are (■=, —~) rather than (0, h/2) 
for a and (0, 1) for /3 as in Jordan's example (e) for N = 2. We can easily 
replace the pair of spin components (a x , a z ) by a pair of quantities (a, (3) that 
do have the exact same eigenvalues as in Jordan's example: 

h ( 1\ a 1 / \ 

The amplitudes ((3\a) = (1/V2) e ipa ' n now express that if the spin in one direc- 
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tion is known, the two possible values of the spin in the directions orthogonal 
to that direction are equiprobable. Moreover, as we just saw, amplitudes {/3\a) 
satisfy the completeness or orthogonality relations (112). It follows that any 
two orthogonal components of spin are canonically conjugate to one another 
on Jordan's new definition] One can thus legitimately wonder whether this 
definition is not getting much too permissive. However, as we will now show, 
the main problem with Jordan's formalism is not that it is asking too little of 
its conjugate variables, but rather that it is asking too much of its canonical 
t r ansformat ions ! 



Canonical transformations enter into the formalism in sec. 3, where Jordan 



(Il927gl . pp. 13-16) introduced a simplified yet at the same time generalized 
yersion o f equa tions (2ab) of Neue Begriindung I for probability amplitudes 
(Jordan, 1927bl . p. 821). They are simplified in that there are no longer ad- 



ditional equations for the Erg anzung Samplitude (ibid., Eqs. (3ab)). They are 
generalized in that they are no longer restricted to systems with only one 
degree of freedom and, much more importantly, in that they are no longer re- 
stricted to cases where all quantities involved have purely continuous spectra. 
Quantities with partly or wholly discrete spectra are now also allowed. 

Recall how Jordan built up his theory in Neue Begriindung I (cf. our discussion 
in Section 2.3). He posited a number of axioms to be satisfied by his probability 
amplitudes. He then constructed a model for these postulates. To this end he 
identified probability amplitudes with the integral kernels for certain canonical 
transformations. Starting with differential equations trivially satisfied by the 
amplitude {p\q) = e~ tpq l h for some initial pair of conjugate variables p and q, 
Jordan derived differential equations for amplitudes involving other quantities 
related to the initial ones through canonical transformations. As we already 
saw above, this approach breaks down as soon as we ask about the probability 
amplitudes for quantities with partly discrete spectra, such as, typically, the 
Hamiltonian. 



Although I Jordan! (Il927gl . p. 14) emphasized that one has to choose initial p's 
and g's with "fitting spectra" (passende Spektren) and that the equations for 
the amplitudes are solvable only "if it is possible to find" such spectra, he 
did not state explicitly in sec. 3 that the cons truction of Neue Begriindung 



I fails for quantities with discrete spectra ] 100 1 That admission is postponed 
until the discussion of canonical transformations in sec. 4. At the beginning of 
sec. 3, the general equations for probability amplitudes are given in the form 



i00[ n Yi[ s Mathematische Begriindung, von Neumann ( 1927al ) had already put his 



finger on this problem: "A special difficulty with [the approach of] Jordan is that 
one has to calculate not just the transforming operators (the integral kernels of 
which are the "probability amplitudes"), but also the value-range onto which one 
is transforming (i.e., the spectrum of eigenvalues)" (p. 3). 
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flJordanl . Il927gi p. 14, Eqs. (2ab)): 



ap 



- & k Q% = 0, 

where A k and B k are defined as [NB2, sec. 3, Eq. (1)]: 



(122) 
(123) 



(124) 



Jordan! fjl927d . pp. 14-15) then showed that the differential equations of Neue 
Begriindung I are included in these new equations as a special case. Since there 
is only one degree of freedom in that case, we do not need the index k. We can 
also suppress all indices of as this is the only amplitude/transformation- 
matrix involved in the argument. So we have A = and B = 
These transformations, however, are used very differently in the two install- 
ments of Neue Begriindung. Although Jordan only discussed this change in 
sec. 4, he already alerted the reader to i t in sec. 3, noti ng that U B, A are the 
operators for (3, a with respect to q, p" ( [Jordan! Il927gi . p. 15) 



Suppressing all subscripts and superscripts, we can rewrite Eqs. (122)-(123) 
as: 

($ B - j3) $ = 0, (125) 



Using that 



a 



f(p,q), $5$ 1 = P = g(p,q) 



(126) 



(127) 



([Jordan! Il927gi . p. 15, Eq. 8); that p and q in the g-basis are represented by 
(h/i)d/dq and multiplication by q, respectively; and that a and $ in Eqs. 
(125)-(126) are represented by —{h/i)d/d(3 and multiplication by 0, respec- 
tively, we see that in this special case Eqs. (122)-(123) (or, equivalently, Eqs. 
(125)-(126)) reduce to [NB2, p. 15, Eqs. (9ab)] 



' (hd_ N 

y 9 \idq q y 



P $ = 0, 



(128) 




0. 



(129) 



which are just Eqs. (2a) of Neue Begriindung I (jJordanl . Il927bl . p. 821; cf. 
Eqs. (50)-(51) with (q\P) written as $). This is the basis for Jordan's renewed 
claim that his general equations for probability amplitudes contain both the 
time-dependent and the time-independent Schrodinger equations as a special 
case (cf. our discussion at the end of Section 2.3). It is certainly true that, 
if the quantity B in Eq. (124) is chosen to be the Hamiltonian, Eq. (128) 
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turns into the time-independent Schrodinger equation. However, there is no 
canonical transformation that connects this equation for ip n (q) = (q\E) to the 
equations trivially satisfied by (p\q) that formed the s tart ing point for Jordan's 
construction of his formalism in Neue Begriindung i j 101 1 



Jordan! (jl927gl . pp . 16-17) fina lly conceded this point in sec. 4 of Neue Begriin- 
dung II. Following iDirad (119271 ). Jordan switched to a new conception of canon- 
ical transformations. Whereas before, he saw canonical transformations such 
as a = TpT~ l ,(3 = TqT~ l , as taking us from one pair of conjugate vari- 
ables (p, q) to a different pair (a,j3), he now saw them as taking us from one 
particular representation of a pair of c onjugate variables to a different repre- 
sentation of those same variables. 102 The canonical transformation used in 



sec. 3, A = 4> _1 a<h, B = 4> _1 /3<I>, is already an example of a canonical trans- 
formation in the new Dirac sense. By giving up on canonical transformations 
in the older sense, Jordan effectively abandoned the basic architecture of the 
formalism of Neue Begriindung I. 



This is how Jordan explained the problem at the beginning of sec. 4 of Neue 
Begriindung II: 

Canonical transformations, the theory of which, as in classical mechanics, 
gives the natural generalization and the fundamental solution of the problem 
of the integration of the equati o ns of motion, were originally [footnote citing 
Born. Heisenberg. and Jordan! fll926l )] conceived of as follows: the canonical 
quantities q, p should be represented as functions of certain other canonical 
quantities (3, a: 

q k = G k 0,a), p k = F k 0,a). (1) 



On the assumption that canonical systems can be defined through the usual 
canonical c ommuta t ion rel ations, a formal proof could be given [footnote 
referring to IJordanl (ll926al )] that for canonical q, p and j3, a equations (1), 
as was already suspected originally, can always be cast in the form 



q k = T(3 k T \ p k = Ta k T~ 



(2) 



However, since, as we saw, the old canonical commutation relations are not 
valid [cf. Eqs. (103)-(107)], this proof too loses its meaning; in general, one 
can not bring equations (1) in the form (2). 

Now a modified conce ption of cano nical transformation was developed 
by Dirac [footnote citing IDirad ( 119271 ) and lLanczoa ( 119261 )] . According to 



101 This probl em does not affect Dirac's version of the theory (see our discussion 
toward the end of Section 1.1). 

102 Cf. the passage from the letter from Dirac to Jordan of December 24, 1916, that 
we quoted toward the end of Section 1.1 (cf. note 42). 
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Dirac, [canonical transformations] are not about representing certain canon- 
ical quantities as functions of other canonical quantities, but rather about 
switching, without a transfor mation of the qu antities themselves, to a dif- 



ferent matrix representation (j Jordan! . Il927gl . pp. 16-17; emphasis in the 
original, hats added). 

In modern terms, canonical transformations in the new Dirac sense transform 
the matrix elements of an operator in one basis to matrix elements of that 
same operator in another. This works whether or not the operator under con- 
sideration is part of a pair of operators corresponding to canonically conjugate 
quantities. The notation $^ = (f3\q) introduced in Neue Begriindung II that 
replaces the notation T in Neue Begriindung I for the operators implementing 
a canonical transformation nicely prepared us for this new way of interpreting 
such transformations. Consider the matrix elements of the position operator 
q (with a purely continuous spectrum) in the g-basis: 

(q'\q\q") = q'8(q' -q")- (130) 

Now let (3 be an arbitrary self-adjoint operator. In general, $ will have a 
spectrum with both continuous and discrete parts. Von Neumann's spectral 
theorem tells us that 

P = "£f3 n \f3 n )(f3 n \ + [ P \P) (P\ d/3, (131) 
n J 

where sums and integrals extend over the discrete and continuous parts of the 
spectrum of (3, respectively. In Jordan's notation, the spectral decomposition 
of (3 can be written more compactly as: (3 = (3 \/3}{/3\. We now want to find 
the relation between the matrix elements of q in the /3-basis and its matrix 
elements in the g-basis. Using the spectral decomposition / q \q)(q\ dq of q, we 
can write: 



(f3'\q\n= I dq'iP'WWWm 

' dq'dq" W)q"S(q' - q")(q"W) (132) 
dq'dq" ((3'\q")(q"\q\q')(q"\(3") 



In the notation of Neue Begriindung II, the last line would be the 'matrix mul- 
tiplication', q^qa- This translation into modern notation shows that 
Jordan's formalism, even with a greatly reduced role fo r canonical transforma- 



tions, implicitly relies on the spectral theorem, which Ivon Neumann! ( 11927a! ) 
published in Mathematische Begriindung, submitted just one month before 
Neue Begriindung II. We also note, however, that an explicit choice of quan- 
tities p and a conjugate to q and (3, respectively, is completely irrelevant for 
the application of the spectral theorem. 
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This gets us to the last aspect of Neue Begriindung II that we want to discuss 
in this section, namely Jordan's response to von Neumann's criticism of the 
Dirac- Jordan transformation theory. A point of criticism we already mentioned 
is that the probability amplitude <p(/3,q) = (j3\q) is determined only up to a 
phase factor. 

As we mentioned in Section 1.3, the projection operator \a){a\ does not change 
if the ket \a) is replaced by e l ^\a) and the bra (a\ accordingly by e~ l ^(a\, where 
$ can be an arbitrary real function of a. Hence, the spectral decomposition of 
(3 in Eq. (131) does not change if \(3) is replaced by e~ tp ^^ h \f3) , where we have 
written the phase factor, in which p is an arbitrary real function of /3, in a way 
that corresponds to Jordan's notation for the resulting phase ambiguity in the 
amplitude (p(/3,q) = (f3\q). Similarly, the spectral decomposition of q does not 
change if we replace \q) by e la ^ q ^ h \q), where a is an arbitrary real function 
of q. However, with the changes \0 ) — > e~ lp ^^ h \(3 ) and \q) — > e la ^ q ^ h \q), the 



amplitude (p(/3,q) = (/3\q) changes (jJordanl . Il927gi . p. 20): 



ifi\q) — ► e iW)+ ^ q))/h {P\q). (133) 

Unlike projection operators, as von Neumann pointed out, probability ampli- 
tudes, are determined only up to such phase factors. 

As we noted above, Jordan responded to this criticism by adding a dependence 
on quantities a and p conjugate to /3 and q, respectively, to the probability am- 
plitude ((3\q), thus arriving at the amplitudes § ap ((3',q') of Neue Begriindung 
II. It turns out that the phase ambiguity of (/3\q) is equivalent to a certain 
freedom we have in the definition of the quantities a and p conjugate to (3 
and q, respectively. The phase ambiguity, as we saw, can be characterized by 
the arbitrary functions p(/3) and cr(q). Following Jordan, we will show that 
our freedom in the definition of at and p is determined by the derivatives p'{f3) 
and cr'(g) of those same functions. By considering amplitudes $> ap ((3',q') with 
uniquely determined at and p, Jordan could thus eliminate the phase ambiguity 
that von Neumann found so objectionable. 



Following iJordanl ( 1l927gl p. 20), we establish the relation between these two 



elements of arbitrariness for the special case that al l quanti ties involved have 



fully continuous spectra. In sec. 6 on spin, IJordanl ( 11927gl pp. 21-25) tried 
to extend his argument to some special cases of discrete spectra. We will not 
discuss those efforts. 

Consider two complete sets of eigenstates of q, {\q)i} and { | g) 2} , related to 
one another via 

\q)i = e^' n \q) 2 . (134) 
This translates into two different amplitudes, (pi((3, q) = (P\q)i and <f>2(P, q) 
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(P\q)2, that differ by that same phase factor: <pi(f3,q) = e to w/ R <p-,(3. q) \ 103 
Suppose pi is conjugate to g if we use the \q)i set of eigenstates of q. Since we 
restrict ourselves to quantities with fully continuous spectra, this means that 
[Pi><z] — h/i- It also means, as we saw in Section 2.1, that 

Pi\q)i = --%)i- (135) 

i oq 

Eigen state s \p)\ of pi can be written as a Fourier series in terms of the \q)\ 
states 
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qe^lq}^ (136) 

We can likewise construct a.p 2 conjugate to q if we use the \q)2 set of eigenstates 
of q. Instead of Eqs. (135)-(136), we then have 

P2\q)2 = - j^kK \Ph = J dqe^ h \q) 2 . (137) 
The relation between these two different conjugate momenta, it turns out, is 

P2=Pi + v\q). (138) 

Note that the commutator [pi, q] does not change if we add an arbitrary func- 
tion of q to pi. To prove that Eq. (138) indeed gives the relation between pi 
and p2, we show that \p) 2 in Eq. (137) is an eigenstate of p 2 as defined in Eq. 
(138), using relation (134) between \q) 1 and \q) 2 : 



103 An argument completely analogous to one we give for the relation between the 
phase factor e t<J ^ q ^ h and the definition of p can be be given for the relation between 
the phase factor e tp ^^ h and the definition of d. 

104 One easily verifies that \p) i is indeed an eigenstate of p\ . The action of p\ on \p) \ 
can be written as: 




Partial integration gives: 

Pi|p>i = / dq-^ (e^ h ) \q) 1 =p J dq^ h \q)i = p\p) lt 
which is what we wanted to prove. 
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p 2 \p) 2 = I dqe^' h p 2 \q) 2 

= J dqe rpq/h (p! + a'(q)) e^ a{q)/h \q)i 

= Jdq(p- a\q) + a'{q)) e ^-^' h \q) 1 
= pj dqe l ^ h \q) 2 =p\p) 2 , 



where in the fourth step we performed a partial integration. This proves that 
the ambiguity (138) in the p conjugate to q corresponds directly to the phase 
ambiguity (134) in the amplitude (j3\q). Similarly, the ambiguity in the a 
conjugate to (3, which is determined only up to a term p'(f3), corresponds 
directly to the phase ambiguity e %p ^l h in the amplitude ((3\q). Hence, for 
specific conjugate variables p and a conjugate to q and (3, the amplitude 
$ ap (f3', q') of Neue Begriindung II is unique up to a constant phase factor (i.e., 
one that is not a function of q or /3). 



In addi tion to responding to von Neumann's criticism of his approach, iJordan 
(Il927gi . p. 20) also offered some criticism of von Neumann's approach. In par- 
ticular, he complained that von Neumann showed no interest in either canon- 
ical transformations or conjugate variables. As we will see when we cover 
von Neumann's Mathematische Begriindung in the next section, this is sim- 
ply because von Neuman did not need either for his formulation of quantum 
mechanics. That formulation clearly did not convince Jordan. In fact, von 
Neumann's paper only seems to have increased Jordan's confidence in his own 
approach. After his brief discussion of Mathematische Begriindung, he con- 
cluded: "It thus appears that the amplitudes the mselves are to b e considered 
the fundamental concept of quantum mechanics" fl Jordan! . Il927gl . pp. 20-21). 



It is unclear whether Jordan ever came to appreciate the advantages of von 
Neumann's approach over his own. In the preface of his texbook on quantum 



mechanics, Anschauliche Quantentheorie, IJordan! (119361 ) described the statis- 
tical transformation theory of Dirac and himself as "the pinnacle of the devel- 
opment of quantum mechanics" (p. VI) and as the "most comprehensive and 
profound version of the quantum laws" (ibid., p. 171; quoted by Duncan and 
Janssen, 2009, p. 361). He did not discuss any of von Neumann's contributions 
in this book. However, in the textbook he co-authore d with Born, Elementare 



Quantenmechanik, we do find elaborate ex positions (IBorn and Jordan! . Il930l . 
Ch. 6, pp. 288-364) of the two papers by von Neumann ( 1927a , bl) that will 
occupy us in the next two sections. 
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5 Von Neumann's Mathematische Begriindung (May 1927) 



In the next two sections we turn our attention to the first two papers of the 
trilogy that von Neumann ( 1927a b. cj) pu blish ed the same ye ar as and partly 
in response to the papers by iDirad (119271 ) and iJordanl (jl927bl ) on transforma- 
tion theory. Th is trilogy provided th e backbone of his famous book published 
five years later ( Ivon Neumann! . Il932l ). The first paper in the trilogy, Mathema- 
tische Begriindung, was presented in the meeting of the Gottingen Academy of 
May 20, 1927. In this paper, von Neumann first introduced the Hilbert space 
formalism and the spectral theorem, at least for bounded operators, two con- 
tributions that have sin ce be come staples of graduate texts in quantum physics 
and functional analysis ] 105 1 In part because of this greater familiarity but also 
because of its intrinsic clarity, von Neumann's Mathematische Begriindung is 
much easier to follow for modern readers than Jordan's Neue Begriindung. 
There is no need for us to cover it in as much detail as we did with Jordan's 
papers in Sections 2 and 4 
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Mathematische Begriindung is divided into nine parts, comprising 15 sections 
and two appendices: 

(1) "Introduction," sec. I, pp. 1-4; 

(2) "The Hilbert space," sees. II- VI, pp. 4-22; 

(3) "Operator calculus," sees. VII- VIII, pp. 22-29; 

(4) "The eigenvalue problem," sec. IX-X, pp. 29-37; 

(5) "The absolute value of an operator," sec. "IX" (a typo: this should be 
XI), pp. 37-41; 

(6) "The statistical assumption [ylnsafe] of quantum mechanics," sees. XII- 
XIII, pp. 42-47; 

(7) "Applications," sec. XIV, pp. 47-50; 

(8) "Summary," sec. XV, pp. 50-51; 

(9) "Appendices," pp. 51-57. 

Abstract Hilbert space is introduced in sees. V-VI, the spectral theorem in 
sees. IX-X. After going over the introduction of the paper, we focus on parts 
of sec. IV and sees. IX-XIII. 



In Sec. IV, von Neumann criticized the way in which wave mechanics and ma- 
trix mechanics are unified in the approach of Dirac and Jordan and presented 
his superior alternative approach to this unification, based on the isomor- 
phism of two concrete instantiations of abstract Hilbert space the space of 



105 m the latter cate g ory we already mentioned Prugovecki ( 198ll ) and 



Dennerv and Krzvwickil (|1996l . C h. 3) (see note 57). 



106 rhis p a p er is a i so discussed by iMehra and Rechenbergl (12000-20011 . Ch. Ill, sec. 
2(d), pp. 411-417). 
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square-summab l e sequ ences % and the space of square-integrable functions 
( Ivon Neumann! . Il927ai . the designations £>, %, and are introduced on pp. 
14-15). In modern notation, this is the isomorphism between I 2 and L 2 . 

Sees. IX-XIII contain von Neumann's criticism of Jordan's use of probability 
amplitudes and his derivation of an alternative formula for conditional prob- 
abilities in quantum mechanics in terms of projection operators. Unlike von 
Neumann, we present this derivation in Dirac notation. 



In the introduction of Mathematische Begriindung, von Neumann ( 1927al . pp. 
1-3) gave a list of seven points, labeled a through d (there is no point 77), in 
which he took stock of the current state of affairs in the new quantum theory 
and identified areas where it ran into mathematical difficulties. We paraphrase 
these points, (a) Quantum theory describes the behavior of atomic systems 
in terms of certain eigenvalue problems. ((3) This allows for a unified treat- 
ment of continuous and discontinuous elements in the atom ic world. (7) The 
theory suggests that the laws of nature are stochastic J 107 1 (5) Returning to 
the formulation of the theory in terms of eigenvalue problems, von Neumann 
briefly characterized the different but equivalent ways in which such prob- 
lems are posed in matrix mechanics and in wave mechanics. Both approaches 
have their difficulties, (e) The application of matrix mechanics appears to be 
restricted to situations with purely discrete spectra. To deal with wholly or 
partly continuous spectra, one ends up using, side by side, matrices with in- 
dices taking on discrete values and "continuous matrices," i.e., the integral 
kernels of the Dirac- Jordan transfor mation theory, with 'i ndices' taking on 



continuous values. It is "very hard," Ivon Neumann! (I1927al . p. 2) warned, to 



do this in a mathematically rigorous way. (() These same problems start to 
plague wave mechanics as soon as wave functions are interpreted as probability 
amplitudes. Von Neumann credited Born, Pauli, and Jordan with transferring 
the probability concepts of matrix mechanics to wave mec hanic s and Jordan 
with developing these ideas into a "closed system" (ibid.) J 108 1 This system, 
however, faces serious mathematical objections because of the unavoidable 



107 Parenthetically, Ivon Neumann! (|l927al . p. 1) added an important qualification, 
"(at least the quantum laws known to us)," leaving open the possibility that, at a 
deeper level, the laws would be deterministic again. Von Neumann's position at this 
point was thus basically the same as Jordan's (see note 40). By the time of the second 
paper of his trilogy, von Neumann had read Heisenberg's (1927b) uncertainty paper 
and endorsed Heisenberg's position that the indeterminism of quantum mechanics 
i s the result of t he inev itable disturbance of quantum systems in our measurements 
(|von Neumannl . Il927bl. p. 273; cf . note 1 37). 



108 In this context von Neumannl ( 1927a . p. 2) referred to his forthcoming paper 



with Hilbert and Nordheim (1928). Oddly, von Neumann did not mention Dirac 
at t his point, although Di rac is mentioned (alongside Pauli and Jordan) in sec. 
XII (von Neumann . 1927al . p. 43) as well as in the second paper of the trilogy 
( von Neumann . 1927bl . p~245; see Section 6). 
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use of improper eigenfunctions, such as the Dirac delta function, the proper- 
ties of which von Neumann thought were simply "absurd" (ibid., p. 3). His 
final objection seems mild by comparison but weighed heavily for von Neu- 
mann: ($) eigenfunctions in wave mechanics and probability amplitudes in 
transformation theory are determined only up to an arbitrary phase factor. 
The probabilities one ultimately is after in quantum theory do not depend 
on these phas e factors and von Neumann therefore wanted to avoid them 
altogether 
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In sec. II, von Neumann set the different guises in which the eigenvalue prob- 
lems appear in matrix and in wave mechanics side by side. In matrix me- 
chanics, the problem is to find square-summable infinite sequences of complex 
numbers v = (vi, i>2, . . .) such that 

Hv = Ev, (140) 

where H is the matrix representing the Hamiltonian of the system in matrix 
mechanics and E is an energy eigenvalue. In wave mechanics, the problem is 
to find square- integrable complex- valued functions f(x) such that 

Hf(x)=Ef(x), (141) 

where H is the differential operator, involving multiplication by x and differ- 
entiation with respect to x, that represents the Hamiltonian of the system in 
wave mechanics. 



One way to unify these two approaches, Ivon Neumann! (Il927al . pp. 10-11) 
pointed out at the beginning of sec. IV, is to look upon the discrete set of 
values 1,2,3... of the index % of the sequences \vi\ c *L x in matrix mechan- 
ics and the continuous (generally mult i- dimensional) domain Q of the func- 
tions f{x) in wave mechanics as two particular realizations of some more 
genera l space, which yon N eumann called R. Following the notation of his 
book ( von Neumann! . 1932 . sec. 4, pp. 1 5—16 ), we call the 'space' of index 



values Z. Eq. (140) can then be written as 



no 



]T H ijVj = E Vi . (142) 

'Summation over Z J can be seen as one instantiation of 'integration over i?;' 
'integration over f2' as another. In this way Eq. (141) can, at least formally, 



109 [n Neue Begrundung II, as we saw in Section 5, Jordanl ( 1927g . p. 8) responded to 



this criticism by adding subscripts to the probability amplitudes for two quantities 
$ and q indicating a specific choice of the canonically-conjugate quantities a and p 
(see Eqs. (133)-(139)). 

110 YVe replaced von Neumann's (1927a, p. 10) Xj's by v^s to avoid confusion with 
the argument (s) of the functions f(x). 
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be subsumed under matrix mechanics. One could represent the operator H in 
Eq. (141) by the integral kernel H(x,y) and write 



dyH(x,y)f(y) = Ef(x) 



(143) 



Both the matrix Hij and the integral kernel H(x,y) can be seen as 'matrices' 
H xy with indices x,y G R. For H^, R = Z; for H(x, y), R = Q. Von Neumann 
identified this way of trying to unify matrix and wave mechanics as Dirac's 
way (and, one may add, although he is not mentioned by name at this point: 
Jordan's way). Von Neumann rejected this approach. He dismissed the analogy 
between Z and Q sketched above as "very s uperficial, as long as o ne stick s to 
the usual measure of mathematical rigor" (jvon Neumann! . Il927al . p. 11) 
He pointed out that even the simplest linear operator, the identity operator, 
does not have a proper integral-kernel representation. Its integral kernel is the 
improper Dirac delta function: / dy S(x — y)f(y) = f(x). 



The appropiate analogy. I von Neumann! (11927al . pp. 11-14) argued, is not be- 
tween Z and Q, but between the space of square- summable se quences over Z 
and th e space of square-integrable functions overQ. In his book . Ivon Neumann 



(119321 . p. 16) used the notation Fz and Fq for these two spaces ] 112 1 In 1927, as 
mentioned abov e, he used £>o and £>, instead. Today they are called I 2 and L 2 , 
respectively ! 113 1 Von Neumann (1927a, pp. 12-13) reminded his readers of the 
"Parseval formula," which maps sequences in I 2 onto functions in L 2 , and a 
"the orem of Fischer and F. Riesz," which maps functions in L 2 onto sequences 
in l 2 \ 114 l The c ombination of these tw o results establishes that I 2 and L 2 are 
isomorphic. As Ivon Neumann! (|1927al p. 12) emphasized, these "mathemati- 
cal facts that had long been known" could be used to unify matrix mechanics 
and wave mechanics in a mathematically impeccable manner. With a stroke 
of the pen, von Neumann thus definitively settled the issue of the equivalence 
of wave mechanics and matrix mechanics. Anything that can be done in wave 



m In Section 1.3, we already quoted some passages from the introduction of von 
Neumann's 1932 book in which he complained about the lack of mathematical rigor 
in Dirac's approach. After characterizing the approach in terms of the analogy 
between Z and $7, he wrote: "It is no wonder that this cannot succeed without 
some violence to formalism and mathematics: the spaces Z and Q are really very 
different and every attempt to est ablish a relation between them must run into great 
di fficulties' ' ( von Neumannl . 19321 . p. 15). 



ii 



•Jammer! (119661. pp . 314-315) also used this 1932 notation in his discussion of 
von Neumann ( 1927a! ) . 



von 



Neumannl ( 1927a . p. 7) remarked that what 



we now 



113 Earlier in his paper, 
call I 2 w as usually called "(complex) Hilbert space." Recall, however, that lLondon 
(jl926bl . p. 197) used t he term "Hilber t space " for L 2 (note 56). 
L14 The paper cited by Ivon Neumannl (Il927al. p. 13, not e 15) is iRiesa (Il907af). In 



his dis cussion of von Neum ann's paper, Jammer (jl966l . pp. 314-315) cited Riessl 
(Il907al lbh and lFischerl (119071 ). 
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mechanics, i.e., in L 2 , has a precise equivalent in matrix mechanics, i.e., in 
l 2 . This is true regardless of whether we are dealing with discrete spectra, 
continuous spectra, or a combination of the two. 



In sec. V. lvon Neumannl (jl927al . pp. 14-18) introduced abstract Hilbert space, 
for which he used the nota tion £>, carefully defining it in terms of five axioms 
labeled A through E j 115 1 In sec. VI, he added a few more definitions and 
then stated and proved six theorems about Hilbert space, labeled 1 through 6 
(ibid., pp. 18-22). In sec. VII, he turned to the discussion of operators acting 
in Hilbert space (ibid., pp. 25). This will be familiar terrain for the modern 
reader and need not be surveyed in any more detail. 

The same goes for sec. VIII, in which von Neumann introduced a special class 
of Hermitian operators. Their defining property is that they are idempotent: 
E 2 = E. Von Neumann c a lled an operat or lik e this an Einzeloperator or E.Op. 



for short (jvon Neumannl . Il927al . p. 25) J 116 l They are now kn own as projec- 
tion op erators. In a series of theorems, numbered 1 through 9, Ivon Neumann 



f ll927al . pp. 26-29) proved some properties of such operators. For our purposes, 
it suffices to know that they are Hermitian and idempotent. 



We do need to take a closer look at sec. IX. In this section, von Neumann 



( 11927al . pp. 29 -33) used proje c tion op erators to formulate the spectral theo- 
rem. Following Ivon Neumannl ( 11927al p. 31), we start by considering a finite 
Hermitian operator A with a non-degenerate discrete spectrum. Order its real 
eigenvalues: a\ < a 2 < a 3 . . . Let |aj) be the associa ted n ormalized eigenvec- 



tors ((a,i\aj) = 5ij). Now introduce the operator E(l)[ 



117 



= \ a i)( a i 



(144) 



(i\ai<l) 



where, unlike von Neumann, we used modern Dirac notation. As we already 
noted in Section 1.3, there is no phase ambiguity in E{1). The operator stays 



the same if we replace \ai) by \ai)' 



'\ai)(ai 



\a>i)(ai 



(145) 



115 [n his book, Ivon Neumann! ( 19321 ) adopted the notation 'H. R.' (shorthand for 
Hilbertscher Raum) for £>. 

116 As he exp lains in a footnote, the term Einzeloperator is based on Hilbert's term 
Einzelfo r m (yon Neumannl . 1927a . p. 25, note 23). For historical discussion, see 
Jammerl |l96d, PP- 317-318). 

117 Von Neumann initially denned this operator in terms of its matrix elements 
(v\E(l)\w) for two arbitrary sequences {vi}f =1 and {u>i}f =1 (where we replaced 
von Neumann's x and y by v a nd w; cf. note 1 10). H e defined (in our notation): 
E(l;x\y) = J2(i\ ai <i)( v \ a i}( a i\ w ) (jvon Neumannl . Il927al . p. 31). 
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Of course, von Neumann did not think of an E. Op. as constructed out of bras 
and kets, just as Jordan did not think of a probability amplitude (a\b) as an 
inner product of \a) and \b). 



The operator E(l) has the property: 

E( ai ) - E{a^ x ) 

It follows that: 



\a>i)(a>i 



A = ^ai(E((Xi) - E{a i - 1 )) = ^a^a^a-i 



(146) 



(147) 



E(l) is piece-wise constant with jumps where / equals an eigenvalue. Hence 
we can write A as a so-called Stieltjes integral, which von Neu mann discussed 
and ill ustrated with some figures in appendix 3 of his paper (jvon Neumann! . 
1927ai pp. 55-57): 

A = I ldE{l). (148) 



As Ivon Neumann! (jl927al p. 32) noted, these results (Eqs. (144)-(148)) can 
easily be generalized from finite Hermitian matrices and finite sequences to 
bounded Hermitian operators and the space £>o or I 2 of infinite square-summable 
sequences. Since #o is just a particular instantiation of the abstract Hilbert 
space it is clear that the same results hold for bounded Her mitian opera- 
tors T in After listing the key properties of E{1) for T l 118 1 he concluded 



sec. IX writing: "We call E(l) the reso lution of unity [Zerlegung der Einheit] 
belonging to T" (jvon Neumann! . Il927al p. 33). 



In sec. X, Ivon Neumann! (11927a! . pp. 33-37) further discussed the spectral 
theorem. Most importantly, he conceded that h e ha d not yet been able to 



prove that it also holds for unbounded operators ! I He only published the 



proof of this generalization in a paper in Mathematische Annalen submitted on 



118 As before (see note 117), he first defined the matrix elements (f\E(l)\g) for two 
arbitrary elements / and g of Hilbert space. So he started from the relation 



</m<?> 



ld{f\E(l)\g), 



and inferred from that, first, that T\g) = I d{E(l)\g)}, and, finally, that T = 
f^ldEil) (cf. Eq. (148)). Instead of the notation (/| a). Ivon Neumann! (|l927al . p. 



12) used the notation Q(f,g) for the inner product of / and g (on p. 32, he also 
used Q(f\g))- So, in von Neumann's own notation, the relation he started from is 
written as Q(f,Tg) = f^l dQ{f, E{l)g) (jvon Neumann! . Il927al . p. 33). 
119 We remind the reader that a linear operator A in Hilbert space is bounded if 
there exists a positive real constant C such that \Af \ < C\f \ for arbitrary vectors 
/ in the space (where |...| indicates the norm of a vector, as induced from the 
defining inner-product in the space). If this is not the case, then there exist vectors 
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December 15, 1928 ( Ivon Neumann! Il929i ) j 120 1 The key to the extension of the 



spectral theore m from bounded to un bounded operators is a so-called Cayley 
transformation ( Ivon Neumann! . Il929l p. 80). Given an unbounded Hermitian 
operator R, introduce the operator U and its adjoint 



U 



R + il 
R-iV 



ft 



R-il 
R + il 



(149) 



where 1 is the unit operator. Since R is Hermitian, it only has real eigenvalues, 
so (R — il)\ip) 7^ for any £ Since U is unitary (UW = 1), the absolute 
value of all its eigenvalues equals 1. U is thus a bounded operator for which 
the spectral theorem holds. If it holds for U, however, it must also hold for the 
original unbounded operator R. T he spectral decomposi tion of R is essentially 
the same as that of U . In his book. Ivon Neumann! (119321 . p. 80) gave Eq. (149), 
but he referred to his 1929 paper for a mathe matically rigorous trea tment of 
the spectral theorem for unbounded operators ( von Neumann! . 1932 . p. 75, p. 
246, note 95, and p. 244, note 78) 

Sec. XI concludes the purely mathematical part of the paper. In this section, 
von Neumann! (Il927al . pp. 37-41) introduced the "absolute value" of an opera- 
tor, an important ingredient, as we will see, in his derivation of his formula for 
conditional probabilities in quantum mechanics (see Eqs. (157)-(161) below). 



In sec. XII, Ivon Neumann! (Il927al . pp. 42-45) finally turned to the statistical 
interpretation of quantum mechanics. At the end of sec. I, he had already 
warned the reader that sees. II-XI would have a "preparatory character" and 
that he would only get to the real subject matter of the paper in sees. XII-XIV. 
At the beginning of sec. XII, the first section of the sixth part of the paper 
(see our table of contents above), on the statistical interpretation of quantum 
mechanics, he wrote: "We are now in a position to take up our real task, the 
mathematically unob j e ct ionable unification of statistical quantum mechanics" 
(Ivon Neumann! . Il927al . p. 42). He then proceeded to use the spectral theorem 
and the projection operators E{1) of sec. IX to construct an alternative to 
Jordan's formula for conditional probabilities in quantum mechanics, which 
does not involve probability amplitudes. Recall von Neumann's objections to 
probability amplitudes (see Sections 1 and 4). First, Jordan's basic amplitudes, 



in the Hilbert space on which the operator A is not well-defined, basically because 
the resultant vector has infinite norm. Instead, such unbounded operators are only 
defined (i.e., yield finite-norm vectors) on a proper subset of the Hilbert space, 
called the domain T>(A) of the operator A. The set of vectors obtained by applying 
A to all elements of its domain is called the range 1Z(A) of A. Multiplication of two 
unbounded operators evidently becomes a delicate matter insofar as the domains 
and ranges of the respective operators may n ot coincide 



i20p Qr fo r j e f discussions, see Ijammerl ( 1966 . p. 320) and iMehra and Rechenberg 
d2000-200ll . p. 415). 
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pip-i l) = e lpq ( see Eq. (4)), which from the perspective of Schrddinger wave 
mechanics are eigenfuncti ons of momentum, ar e not square-integrable and 



hence not in Hilbert space (Ivon Neumann!. 1 1927al. p. 35). S econd, they are only 
determined up to a phase factor (jvon Neumann! . Il927al . p. 3, point Von 
Neumann avoided these two problems by deriving an alternative formula which 
expresses the conditional probability Pr(a|6) in terms of projection operators 
associated with the spectral decomposition of the operators for the observables 
a and b. 



Von Neumann took over Jordan's basic statistical Ansatz. Consider a one- 
particle system in one dimension with coordinate q. Von Neumann (1927a, p. 
43) considered the more general case with coordinates q = (gi, . . . ,qk)- The 
probability of finding a particle in some region K if we know that its energy 
is E n (i.e., if we know the part icle i s in the pure state ip n (x) belonging to that 
eigenvalue), is given by (ibid.)j^D 



Pr(g in K\E n ) = J K \4j n {q)\ 2 dq. 



(150) 



Next, he considered the probability of finding the particle in some region K if 
we know that its energy is in some interval / that includes various eigenvalues 
of its energy, i.e., if the particle is in some mixed state where we only know 
that, with equal probability, its state is on e of t he pure states ^ n (x) associated 
with the eigenvalues within the interval / 
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Pr(g in K\E n in J) 



E 

{n\E n ml) 



K 



\Mv)\ 2 dq. 



(151) 



The distinction between pure states (in Eq. (150)) and mixed states (in Eq. 
(151)) slipped in here was o nly made explicit in the second paper in the 
trilogy (jvon Neumann! . 1 1927b! ) . These conditional probabilities can be written 
in terms of the projection operators, 



(n\E n in I) 



K 



\q)(q\ dq, 



(152) 



that project arbitrary state vectors onto the subspaces of spanned by 'eigen- 
vectors' of the Hamiltonian H and of the position operator q with eigenvalues 
in the ranges / and K, respectively. The right-hand side of Eq. (151) can be 



121 The left-hand side is short-hand for: Pr(g has value q in K\H has value E n ). We 
remind the reader that the notation Pr(. | .) is ours and is not used in any of our 
sources. 

122 The left-hand side is short-hand for: Pr(q has value q in K\H has value E n in /). 
On the right-hand side, Yl( n \E n mi) ^ s ^ ne sum over & U n such that E n lies in the 
interval /. 
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rewritten as: 

E j K (i>n\q)(q\^n)dq. (153) 

(n\E n in I) 

We now choose an arbitrary orthonormal discrete basis {|a)}^L 1 of the Hilbert 
space £>. Inserting the corresponding resolution of unity, 1 = J2a \ a )( a \i m t° 
Eq. (153), we find 

EE/ ma){a\q){q\^ n )dq. (154) 
This can be rewritten as: 



EH ( / I?X?I4Z • E hAn>(^n|] |a>. 
\ * (n|£„mJ) / 



(155) 



This is nothing but the trace of the product of the projection operators F(K) 
and E(I) defined in Eq. (152). The conditional probability in Eq. (151) can 
thus be written as: 

Pr(x in K\E n in I) = J2(a\F(K)E(I)\a) = Tr(F(K)E(I)). (156) 



This is our notation for what Ivon Neumann! (jl927al . p. 45) wrote ai 



12.3 



(157) 



He defined the quantity [A, B] — not to be confused with a commutator — as 
(ibid., p. 40): 

[A,B] = [A*B]. (158) 

For any operator O, he defined th e qu antity [O], which he called the "absolute 
value" of O, as (ibid., pp. 37-38) 



121 



2 

vl | ) 



(159) 



123 S'mce von Neumann (ibid., p. 43) chose K to be fc-dimensional, he actually wrote: 
[-Fi(Ji) • . . . • Fk(Jk),E(I)\ (ibid., p. 45; hats added). For the one-dimensional case 
we are considering, von Neumann's expression reduces to Eq. ( 157) . For other dis- 



formula, see iJammerl (|1966l . pp. 



cussions of v on Neumann's derivation of this ke; 
320-321) and lMehra and Rechenbergl (j2000-200li . p. 414). 
124 Using th e notation Q(. , .) for t he inner product (see note 118) and using A in- 
stead of O. Ivon Neumann! ()1927al . p. 37) wrote the right-hand side of Eq. (159) as 

E~=ilQ(^,^)l 2 - 



88 



where and are two arbitrary orthonormal bases of Eq. 

(159) can also be written as: 

[6] = EW^XiM^W = = (ieo) 

where we used the resolution of unity, 1 = IVvX^i/I, and the fact that 
Tr(O) = J2 a ( a \0\ a ) f° r any orthonormal basis {|q;)}^L 1 of £> \ 125 1 Using the 
definitions of [A,B] and [O] in Eqs. (158) and (160), with A = F(K), B = 
E(I), and 6 = F{K)^E{J), we can rewrite Eq. (157) as 

[F, E) = [F ] E] = Tr((F t E)(F t E) t ) = Tr(F*E&P), (161) 

where, to make the equation easier to read, we temporarily suppressed the 
value ranges K and I of F(K) and E(I). Using the cyclic property of the 
trace, we can rewrite the final expression in Eq. (161) as Tr(FF'EE^). Since 
projection operators P are both Hermitian and idempotent, we have FF' = 
F 2 = F and EE' = E 2 = E. Combining these observations and restoring the 
arguments of F and E, we can rewrite Eq. (161) as: 

[F(K),E(I)]=Tr(F(K)E(I)), (162) 

which is just Eq. (156) for Pr(x in K\E n in I) found above. 

From Tt(FE) = Tr(EF), it follows that 

Pr(x in K\E n in I) = Pr(E n in I\x in K), (163) 

which is just the symmetry property impos ed on Jordan's probability ampli- 



tudes in postulate B of Neue Begriindung I fl Jordan. 1 192 7bl. p . 813; see Section 
2.1) and postulate II in Neue Begriindung II ([Jordan! Il927gl . p. 6; see Section 
4). 

Von Neumann generalized Eq. (156) for a pair of quantities to a similar for- 
mula for a pair of sets of quantities such that the operators for all quanti- 
ties in each set commute with those for all other quantities i n that same set 



but no t neces sarily with those for quantities in the other set (jvon Neumann 
1927ai p. 45)13 Let {Ri}™ =1 and {Sj}f =1 be two such sets of commut- 



ing operators: [Ri 1 ,Ri 2 ] = for all 1 < i±,i2 < n; [Sj-^.S^] = for all 



125 Eq. (160) shows that [O] is independent of the choice of the bases {|^)}^Li 
and {|V^)}^Li- Von Neumann (1927a, p. 37) initially introduced the quantity 
[O; ip^;ip u ] = Ylfj, v K^/JO | | 2 . He then showed that this quantity does not ac- 
tually depend on ip^ and ip u , renamed it [O] (see Eq. (159) and note 124), and 
called it the "absolute value of the operator" O (ibid., p. 38). 

126 Von Neumann distinguished between the commuting of R{ and Rj and the com- 
muting of the corresponding projection operators Ei(Ii) and EAlj). For bounded 
operators, these two properties are equivalent. If both Ri and Rj are unbounded, 
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1 < jitj2 — m > 127 1 Let Ei(Ii) (i = 1, . . . , n) be the projection operators onto 
the space spanned by eigenstates of Ri with eigenvalues in the interval Jj and 
let Fj(Jj) (j = 1, . . . ,m) likewise be the projection operators onto the space 
spanned by eigenstates of Sj with eigenvalues in the interval Jj (cf. Eq. (152)). 
A straightforward generalization of von Neumann's trace formula (156) gives 
the probability that the S^-'s have values in the intervals Jj given that the i?j's 
have values in the intervals 



Pr(S/s in J/s\R/s in J/s) = Tr (E^h) . . . £ n ,(/ n ,)A( Ji) • ■■F m {J rn )). (164) 

The outcomes in Jj' are called the "assertions" ( Behauptun qen) and the 
outcom es l Sj in Jj' are called the "conditions" (Voraussetzungen) (jvon Neumann! . 
1927al . p. 45). Because of the cyclic property of the trace, which we already 
invoked in Eq. (163), Eq. (164) is invariant under swapping all assertions with 
all conditions. Since all ^(J^'s commute with each other and all Fj( Jj)'s com- 
mute with each other, Eq. (164) is also invariant under changing the order of 
the assertions and changing the order of the conditions. These two properties 
are given in the first two entries of a list of five properties, labeled a through 
9 (there are no poin ts C and 77), of the ba sic rule (164) for probabilities in 
quantum mechanics (jvon Neumann! . 1 1927a] . pp. 45-47). 



Under point 7, von Neumann noted that projection operators F( J) and E(I) 
for "empty" (nichtssagende) assertions and conditions, i.e., those for which 
the intervals J and / are (—00, +00), can be added to or removed from Eq. 
(164) without affecting the result. 

Under point 5, von Neumann, following Jordan, noted that the standard mul- 
tiplication law of probabilities does not hold in quantum mechanics. Paren- 
thetically, he added, "(what does hold is a weaker law corresponding to the 
"superposition [Zusammensetzung] of probability a mplitudes" in [the form al- 
ism of] Jordan, which we will not go into here)" ( von Neumann! . 1927a . p. 
46). Note that von Neuman n did not use Jordan's (1927b, p. 812) phrase 
"interference of probabilities." 
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Under point e, Ivon Neumann! (Il927af) wrote: "The addition rule of probabili- 
ties is valid" (p. 46). In general, as I Jordan! (!l927bl p. 18) made clear in Neue 
Begriindung I (see Eq. (1) in Section 1), the addition rule does not hold in 
quantum mechanics. In general, in other words, Yi(A or B) 7^ Pr(A) + Pr(5), 



von Neumann! (jl927al . p. 45) cautioned, "certain difficulties of a formal nature occur, 
which we do not want to go into here" (cf. note 119). 

127 Qnlike von Neumann (see Eqs. (157)-(158)), we continue to use the notation [., .] 
for commutators. 

128 After reading Heisenberg's critic ism of this aspect of Jordan's theory in the un- 
certainty paper (!Heisenberd.ll927bl. pp. 1 83-184. p. 196; cf. note 38), von Neumann 
changed his mind (jvon Neumann! . Il927bl . p. 246; cf. note 133). 
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even if the outcomes A and B are mutually exclusive. Instead, Jordan pointed 
out, the addition rule, like the multiplication rule, holds for the corresponding 
probability amplitudes. Von Neumann, however, considered only a rather spe- 
cial case, for which the addition rule does hold for the probability themselves. 
Consider Eq. (156) for the conditional probability that we find a particle in 
some region K given that its energy E has a value in some interval /. Let the 
region K consist of two disjoint subregions, K' and K", such that K = K'UK" 
and K'P\K" = 0. Given that the energy E lies in the interval /, the probability 
that the particle is either in K' or in K", is obviously equal to the probability 
that it is in K. Von Neumann now noted that 

Pr(x in K\E in /) = Pr(x in K'\E in J) + Pr(x in K"\E in I). (165) 

In terms of the trace formula (156), Eq. (165) becomes: 

Tr(F(K)E(I)) = Ti(F(K')E(I)) + Tr(F(K")E(I)). (166) 

Similar instances of the addition rule obtain for the more general version of 
the trace formula in Eq. (164). 

Under point finally, we find the one and only reference to "canonical trans- 
formations" in Mathematische Begriindung. Von Neumann (1927a, pp. 46- 
47) defined a canonical transformation as the process of subjecting all opera- 
tors A to the transformation UAW, where U is some unitary operator. The 
absolute value squared [A] is invariant under such transformations. Recall 
[A] = Tr(iit) (see Eq. (160)). Now consider [UAU*]: 

[UAU ] ] = Tr(uAfr(UAfry) = Tt{UAU ] UA ] U ] ) = Tr(ii f ) = [A]. (167) 

Traces of products of operators are similarly invariant. This definition of 
canonical transformations makes no reference in any way to sorting quantities 
into sets of conjugate variables. 



Von Neumann's Wahrscheinlichkeitstheoretischer Aufbau (November 
1927) 



On November 11, 1927 , about half a year aft er the first installment, Mathe- 



matische Begriindung (jvon Neumann! . Il927al ). the second and the third in- 



stallments of von Neumann's 19 2 7 tril ogy were presented to the Gottingen 
Academy ( von Neumann . 1927bl lcl) ] 129 The second, Wahrscheinlichkeitstheo- 



retischer Aufbau, is important for our purposes; the third, Thermodynamik 



129 These three papers take up 57, 28, and 19 pages. The first installment is thus 
longer than the other two combined. Note that in between the first and the two 



later installments, Neue Begriindung II appeared (see Section 4), in which I Jordan 
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quantenmechanischer Gesamtheiten, is not J 130 1 In Mathematische Begriindung, 
as we saw in Section 5, von Neumann had simply taken over the basic rule for 
probabilities in quantum mechanics as stated by Jordan, namely that prob- 
abilities are given by the absolute square of the corresponding probability 
amplitudes, the prescription now known as the Born rule. In Wahrschein- 
lichkeitstheoretischer Aufbau, he sought to derive this rule from more basic 
considerations. 



In the introduction of the paper, Ivon Neumann! (jl927bl . p. 245) replaced the 
old opposition between "wave mechanics" and "matrix mechanics" by a new 
distinction between "wave mechanics" on the one hand and what he called 
"transformation theory" or "statistical theory," on the other. By this time, 
matrix mechanics and Dirac's g-number theory had morphed into the Dirac- 
Jordan statistical transformation theory. The two names von Neumann used 
for this theory reflect the difference in e mpha sis between Dirac (transformation 
theory) and Jordan (statistical theory) J 131 1 Von Neumann mentioned Born, 
Pauli, and London as the ones who had paved the way for the statistical theory 
and Dirac and Jordan as the ones respons ible for bringing this development 
to a conclusion (ibid., p. 245; cf. note 108) 
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Von Neumann was dissatisfied with the way in which probabilities were in- 
troduced in the Dirac- Jordan theory. He listed two objections. First, he felt 
that the relation between quantum probability concepts and ordinary proba- 
bility theory needed to be clarified. Second, he felt that the Born rule was not 
well-motivated: 



( 1927gl ) responded to von Neumann's criticism in Mathematische Begriindung. Von 

Neumann made no comment on this resp onse in these two later p a pers. 

i30p or discussion of this third paper, see iMehra and Rechenbera 1)2000-200 ll . pp. 
439-445). See pp. 431-436 for their discussion of the second paper. 
131 In Section 1.3, we already quoted his observation about the Schrodinger wave 
function, (q\E) in our notation: "Dirac interprets it as a row of a certain transf or- 
mation matrix, Jordan calls it a probability amplitude" (jvon Neumann! . Il927bl . p. 
246, note 3). 

132 He cited the relevant work by Dirac ( 19271 ) and Jordan ( 1927b g). He di d not 
give re feren ces for the ot her t hree authors but presumably was thinking of IBorn 
(ll926al lblkl). |Paulil (jl927ah . and iLondonl (jl926bh . The reference to London is some- 
what puzzling. While it is true that London anticipated important aspects of the 
Dirac- Jordan transformation theory (see Lacki, 2004; Duncan and Janssen, 2009), 
the statistical interpretation of the formalism is not among those. Our best guess 
is that von Neumann took note of Jordan's repeated acknowledgment of London's 
paper (most p r omine ntly perhaps in footnote 1 of Neue Begriindung I). In his book. 



von Neumann! (11932. p. 2, not e 2; the note itse lf is on p. 238) cited papers by 



1927 L I Jordan! (jl927bh . and ILondonl (jl926bh in addition to the book by 



Dirac 



Dirac 



1930) for the development of transformation theory. In that context, the reference 



to London is entirely appropriate. 
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The method hitherto used in statistical quantum mechanics was essentially 
deductive: the square of the norm of certain expansion coefficients of the 
wave function or of the wave function itself was fairly dogmatically set equal 
to a probability, and agreement with experience was verified afterwards. A 
systematic derivation of quantum mechanics from empirical facts or fun- 
damental probability-theoretic assumptions, i.e., an inductive justification, 
was not given. Moreover, the relation to the ordinary probability calculus 
was not sufficiently clarified: the validity of its basic rules (addition and 
multiplication law of t he probability calculus ) wa s not sufficiently stressed 
fjvon Neumann! . Il927bl . p. 246; our emphasis) 
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To address these concerns, von Neumann started by introducing prob abilities 
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He then 

sec 



in terms of selecting members from a large ensemble of systems 
presented his "inductive" derivation of his trace formula for probabilities 
Eq. (156)), which contains the Born rule as a special case, from two very 
general, deceptively innocuous, but certainly non-trivial assumptions about 
expectation values of properties of the systems in such ensembles. From those 
two assumptions, some key elements of the Hilbert space formalism introduced 
in Mathematische Begrundung, and two assumptions about the repeatability 
of measurements not explicitly identified until the summary at the end of the 
paper (ibid., p. 271), Von Neumann indeed managed to recover Eq. (156) for 
probabilities. He downplayed his reliance on the formalism of Mathematische 
Begrundung by characterizing the assumptions taken from it as "not very far 
going formal and material assumptions" (ibid., p. 246). He referred to sec. IX, 



133 The last se ntence is in response to Heisenberg's criticism in his uncertainty paper 
(|Heisenberei . Il927bl . pp. 183-184; cf. note 38) of Jordan's concept of "interference 
of probabilities" defined in Neue Begrundung I as "the circumstance that not the 
probabilities themselv es but their amp litudes obey the usual composition law of the 
probability c alculus" (IJordanl. Il927bl, p. 812). Earlie r von Neumann had endorsed 
this concept: 



Hilbert. von Neumann, and Nordheim ( 19281 . p. 5; cf. note 93) and 



von Neumann! (|1927al. p. 46: c f. not e 128). 



i34j n n i s book, von Neumann! (jl932l . p. 158, p. 255, note 156) referred to a book by 
Richard Ivon Misesl (|l928l ) for this notion of an ensemble ( Gesamtheit or Kollektiv) 
( Lacki . 2000l . p. 308). Von Neumann may have picked up this notion from von Mises 
in the period leading up to his Habilitation in mathematics in Berlin in Decem ber 
1927. Vo n Mises was one of his examiners (jMehra and Rechenbera . 12000— 200ll . p. 



402). As Ivon Misesl (119281 ) explained in its preface, his book elaborates on ideas he 
had been presenting for "about fifteen years" in various talks, courses, and articles. 
Von Mises defined a collective {Kollektiv) as an ensemble {Gesamtheit) whose mem- 
bers are distinguished by some observable marker {beobachtbares merkmal). One of 
his examples is a group of plants of the same species grown from a given collec- 
tion of seeds, where the individual plants differ from one another in the color of 
their flowers (ibid., pp. 12-13 ). In their discuss i on of von Neumann's Wahrschein- 
lichke itstheoretischer Aufbau, Born and Jordan ( 1930l . p. 306) also cited von Misesl 
(119281 ) 
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the summary of the paper, for these assumptions at this point, but most of 
them are already stated, more explicitly in fact, in sec. II, "basic assumptions" 
(ibid., pp. 249-252). 

Consider an ensemble {©i, ©2> ©3 ; • • •} of copies of a system @. Von Neumann 
wanted to find an expression for the expectation value 8(a) in that ensemble of 
some property a of the system (we use 8 to distinguish the expectation value 
from the projection operator E ). He made the following basic assumptions 
about 8 flvon Neumann! . Il927bl . pp. 249-250): 



A. Linearity: 8(aa + /36 + 7C + .) = a 8(a) + (38(b) + 7<£(c) + . . . (where a, 
(3, and 7 are real numbers) 



135 



B. Positive-definiteness. If the quantity a never takes on negative values, then 
8(a) > 0. 

To this he added two formal assumptions (ibid., p. 252): 

C. Linearity of the assignment of operators to quantities. If the operators S, 
T, . . . represent the q uanti ties a, 6, ... , then aS + (3T + . . . represents the 
quantity a a + (3 i> + . . 
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D. If the operator S represents the quantity 0, then f(S) represents the quan- 
tity f(a). 

In sec. IX, the summary of his paper, von Neumann once again listed the 
assumptions that go into his derivation of the expression for 8(a). He wrote: 



The goal of the present paper was to show that quantum mechanics is not 
only compatible with the usual probability calculus, but that, if it [i.e., 
ordinary probability theory] — along with a few plausible factual [sachlich] 
assumptions — is taken as given, it [i.e., quantum mechanics] is actually the 
only possible solution. The assumptions made were the following: 
1. Every measurement changes the measured object, and two measurements 



i35jj ere von N eumann appended a footnote in which he looked at the example of a 
harmonic oscillator in three dimensions. The same point can be made with a one- 
dimensional harmonic oscillator with position and momentum operators q and p, 
Hamiltonian H, mass m, and characteristic angular frequency u: "The three quan- 
tities \p/2m, muj 2 q 2 /2, H = p/2m + mw 2 g 2 /2] have very different spectra: the first 
two both have a continuous spectrum, the third has a discrete spectrum. Moreover, 
no two of them can be measured simultaneously. Nevertheless, the sum of the ex- 
pectation values of the first two equals the expectation value of the third" (ibid., p. 
249). While it may be reasonable to impose condition (A) on directly measurable 
quantities, it is questionable whether this is also reasonable for hidden variables (see 
note 140). 

i36j n VQn N eumann ' s own notation, the operator S and the matrix S representing 
that operator are both written simply as S. 
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therefore always disturb each other — except when they can be replaced 
by a single measurement. 

2. However, the change caused by a measurement is such that the measure- 
ment itself retains its validity, i.e., if one repeats it immediately after- 
wards, one finds the same result. 

In addition, a formal assumption: 

3. Physical quantities are to be described by functional operators in a man- 
ner subject to a few simple formal rules. 

Thes e principles already in evitably entail quantum mechanics and its statis- 



tics (jvon Neumann! . Il927bl . p. 271). 



Assumptions A and B of sec. II are not on this new list in sec. IX. Presum- 
ably, this is because they are part of ordinary probability theory. Conversely, 
assumptions 1 and 2 of sec. IX are not among the assumptions A-D of sec. II. 
These two properties of measurements, as we will see below, are guaranteed 
in von Neumann's formalism by the i dem potency of the projection operators 



associated with those measurements ] I Finally, the "simple formal rules" 



referred to in assumption 3 are spelled out in assumptions C-D. 

We go over the main steps of von Neumann's derivation of his trace formula 
from these assumptions. Instead of the general Hilbert space |>, von Neumann 
considered $>o, i.e., I 2 (von Neumann, 1927b, p. 253; cf. von Neumann, 1927a, 
pp. 14-15 [see Section 5]). Consider some infinite-dimensional Hermitian ma- 
trix S, with matrix elements = s*^, representing an Hermitian operator S. 
This operator, in turn, represents some measurable quantity a. The matrix S 
can be written as a linear combination of three types of infinite-dimensional 
matrices labeled A, B, and C. To show what these matrices look like, we write 



137 Assumption 2 is first introduced in footnote 30 on p. 262 of von Neumann's 
(1927b) paper: "Although a measurement is fundamentally an intervention (Ein- 
griff), i.e., it changes the system under investigation (this is what the "acausal" 
character of quantum mechanics is based on, cf. [Heisenberg, 1927b, on the un- 
certainty principle]), it can be assumed that the change occurs for the sake of the 
experiment, i.e., that as soon as the experiment has been carried out the system is in 
a state in which the same measurement can be carried out without further change 
to the system. Or: that if the same measurement is performed twice (and noth- 
ing happens in between), the result is the same." Von Neumann (1927c) reiterated 
assumptions 1 and 2 in the introduction of Thermodynamik quantenmechanischer 
Gesamtheiten and commented (once again citing Heisenberg's uncertainty paper): 
"1. corresponds to the explanation given by Heisenberg for the a-causal behavior of 
quantum physics; 2. expresses that the theory nonetheless gives the appearance of 
a kind of causality" (p. 273, note 2). 
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down their finite-dimensional counterparts: 
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The A^s have 1 in the /x th row and the /i th column and O's everywhere else. 
The B^s (/i < u) have 1 in the /i th row and the is th column and in the u th 
row and the /z th column and O's everywhere else. The C^'s (fi < v) have i in 
the /i th row and the u th column and —% in the u th row and the /i th column and 
O's everywhere else. 



The A^s have eigenvectors (eigenvalue 1) with 1 in the /i row and O's ev- 
erywhere else; and infinitely many eigenvectors with eigenvalue 0. The B^s 
have eigenvectors (eigenvalue 1) with l's in the fi th and the u th row and O's 
everywhere else; eigenvectors (eigenvalue —1) with 1 in the /i th row, —1 in 
the z/ th row, and O's everywhere else; and infinitely many eigenvectors with 
eigenvalue 0. The C^'s have eigenvectors (eigenvalue 1) with i — 1 in the /i th 
row, i + l in the z/ th row, and O's everywhere else; eigenvectors (eigenvalue —1) 
with — 1 — i in the /i th row, 1 — i in the u th row, and O's everywhere else; and 
infinitely many eigenvectors with eigenvalue 0. 

For the counterpart of B^ v in a simple finite case (with fi, v = 1, 2, 3), we have: 
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For the counterpart of C^ v we similarly have 
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The matrix S can be written as a linear combination of A, B, and C: 

S = ^ s w • A^ + Re ■ 5^ + ^ Im • C^, (168) 



^1 /x<^ /i<v 

where Re and Im s^,, and the real and imaginary parts of s^ u , respectively. 
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Using von Neumann's linearity assumption (A), we can write the expectation 
value of S in the ensemble {©i, ©2, ©3? • • •} as: 

= E s ^ ■ + E Re s ^ ■ £ ( B ^) + E Im s ^ ■ ( 169 ) 

Since the eigenvalues of A^, B^ u , and C^ v are all real, the expectation values 
S(A )J ), EiB^y), and £{C^ U ) are also real. Now define the matrix U (asso- 
ciated with some operator U) with diagonal components u MM = S(A fl ) and 
off-diagonal components (// < v): 

= l - (S(B^) +i£(C^)) , u vtl = l - {£(B^) -%£{C,J)) . (170) 

Note that this matrix is Hermitian: u*^ u = u Ufl .. With the help of this matrix U, 
the expectation value of S can be written as (jvon Neumann! . Il927bl . p. 253): 

= E s iiv u vn- ( 171 ) 

To verify this, we consider the sums over \i — v and /i^y separately. For the 
former we find 

E s w u w = E s^-SiAf,). (172) 

For the latter, we have 

^ ] SjU!/ ^l/p ^ ] Ul//Lt ^ ] u u ^. (173) 
fj^v H<v \x>v 

The second term can be written as X)„ >M = X)^< v s % which means 

that 

E s l^ u vn = E 2Re ( S Mf W ^)- ( 174 ) 

Now write s MI , as the sum of its real and imaginary parts and use Eq. (170) 
for u Vfl : 



E •V u ^ = E Re{(Res M! , + zlms^) ■ {£{B^ V ) - iS^C^))} 

= £ Re^ • £(B IJLV ) + ^ Im V • ^(C^). (175) 

Adding Eq. (172) and Eq. (175), we arrive at 
E "V M ^ = E S W + E Re "V ' + E Im, V ' £ ( c ^)- ( 176 ) 

Eq. (169) tells us that the right-hand side of this equation is just £(S). This 
concludes the proof of Eq. (171), in which one readily recognizes the trace of 
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the product of S and U 



138 



£{S) = £ = £(s^W = Tr (^)- ( 177 ) 



fJ,V 



In other words, {7 is what is now called a density matrix, usually denoted by 
the Greek letter p. It corresponds to a density operator U or p. 

The matrix U characterizes the ensemble @ 2; ©3; • • •}• Von Neumann 
(1927b, sec. IV, p. 255) now focused on "pure" (rein) or "uniform" (ein- 
heitlich) ensembles, in which every copy of the system is in the exact same 
state. Von Neumann characterized such ensembles as follows: one cannot ob- 
tain a uniform ensemble "by mixing (vermischen) two ensembles unless it 
is the case that both of these correspond to that same ensemble" (ibid., p. 
256). Let the density operators U, U*, and U** correspond to the ensembles 
{©i}, {©*}, {©£*}, respectively. Suppose {©;} consists of rj x 100% {©*} and 
$ x 100% The expectation value of an arbitrary property represented 

by the operator S in is then given by (ibid.): 

S(S) = v£*(S) + #£**(S), (178) 

where S* and E** refer to ensemble averages over {©*} and {©"}, respectively. 
Using Eq. (177), we can write this as: 

Tr (US) = r]Tr(U*S) + $Tr(U**S). (179) 

Since S is arbitrary, it follows that U, U*, and U** satisfy (ibid.): 

U = 7]U* + $U**. (180) 

Von Neumann now proved a theorem pertaining to uniform ensembles (ibid., 
pp. 257-258). That U is the density operator for a uniform ensemble can 
be expressed by the following conditional statement: If (U = U* + U**) then 
(U* oc U** oc U). Von Neumann showed that this is equivalent to the statement 
that there is a unit vector \tp) suc h that U is the projection operator onto that 



vector, i.e., U = P Lp = |cg)(cg| j I Written more compactly, the theorem says: 



{ ((J = (J* + U**) (U* oc U** oc U) } & { 3 \<p), U = P V = \<p){<p\ }. (181) 

The crucial input for the proof of the theorem is the inner-product structure 
of Hilbert space. The theorem implies two important results, which, given the 



138 Von Neumann (1927b, p. 255) only wrote down the first step of Eq. (177). It 



was only in the third installment of his trilogy, that Ivon Neumann! (|1927ci . p. 274) 
finally introduced the notation trace (Spur), which we used here and in Eq. (156) 
in Section 5. For another discussion of the derivation of the key formula (177), see 
Mehra and Rechenberd (|2000-200ll . pp. 433-434). 



139 The notation (except for the hat) is von Neumann's own (ibid., p. 257). 
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generality of the assumptions going into its proof, have the unmistakable flavor 
of a free lunch. First, pure di sper sion-free states (or ensembles) correspond to 



unit vectors in Hilbert space ] 140 1 Second, the expectation value of a quantity 
a represented by the operator S in a uniform ensemble {©i} characterized by 
the density operator U = \<p)(<p\ is given by the trace of the product of the 
corresponding matrices: 

S(S) = Tr(US) = Tr(\tp)&\S) = (<p\S\<p), (182) 

which is equivalent to the Born rule. 

Von Neumann was still not satisfied. In sec. V, "Measurements and states," 
he noted that 

our knowledge of a system ©', i.e., of the structure of a statistical ensemble 
{©i,®2> • • •}> i s never described by the specification of a state — or even 
by the corresponding ip [i.e., the vector \(p)]; but usua l ly by t he result of 



measurements performed on the system (jvon Neumann! . Il927bl . p. 260). 



He considered the simultaneous measurement of a complete set of commuting 
operators and constructed a density operator for (the ensemble represent- 
ing) the system on the basis of outcomes of these measurements showing the 
measured quantities to have values in certain intervals. He showed that these 
measurements can fully determine the state and that the density operator in 
that case is the projection operator onto that state. 

Let {Sfj} (/x = l,...,m) be a complete set of commuting operators with 
common eigenvectors, {|cr n )}, with eigenvalues A M (n): 

S>nWn) = -V( n ) I "")" ( 183 ) 

Now construct an operator S with those same eigenvectors and completely 
non-degenerate eigenvalues X n : 

S\a n ) = \ n \a n ), (184) 



U0 This is the esse nce of von Neumann's later no-hidden variables p roof 
( von Neumann . 19321 . Ch. 4, p. 171), which was criticized by John (|1966l . pp. 



1-5), who questioned the linearity assumption (A), £(act + /3&) = otB(a)+ f3 £ (6) (see 
note 135). Bell argued, with the aid of explicit examples, that the linearity of expec- 
tation values was too strong a requirement to impose on hypothetical dispersion- free 
states (dispersion- free via specification of additional "hidden" variables). In particu- 
lar, the dependence of spin expectation values on the (single) hidden variable in the 
explicit example provided by Bell is manifestly nonlinear, although the model re- 
produces exactly the standard quantum-mechanical result s when one averages (uni- 



formly ) ove r the hidde n variable. For recent discussion, see lBacciagaluppi and Crull 
(120091 ) and [EM] ifcoid ). 
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with A n 7^ \ n i if n ^ nl . Define the functions / M (A n ) = A At (n). Consider the 
action of f^(S) on \a n ): 

U{S) \(T n ) = / M (A„) \<r n ) = A M (n) |<r n ) = |a n ). (185) 

Hence, 5^ = f^(S). It follows from Eq. (185) that a me a sureme nt of S uniquely 



determines the state of the system. As lvon Neumann! (jl927bl ) put it: "In this 



way measurements have been identified that uniquely determine the state of 
[the system represented by the ensemble] (p. 264). 

As a concrete example, consider the bound states of a hydrogen atom. These 
states are uniquely determined by the values of four quantum numbers: the 
principal quantum number n, the orbital quantum number /, the magnetic 
quantum number mi, and the spin quantum number m s . These four quan- 
tum numbers specify the eigenvalues of four operators, which we may make 
dimensionless by suitable choices of units: the Hamiltonian in Rydberg units 
(H/Ky), the angular momentum squared (L 2 /h 2 ), the ^-component of the 
angular momentum (L z /h), and the ^-component of the spin (a z /h). In this 
case, in other words, 

{S^ =1 = (H/Ry, L 2 /h 2 , L z /h, a z /K). (186) 

The task now is to construct an operator 5* that is a function of the S^'s 
(which have rational numbers as eigenvalues) and that has a completely non- 
degenerate spectrum. One measurement of S then uniquely determines the 
(bound) state of the hydrogen atom. For example, choose a, (3, 7, and 5 to be 
four real numbers, incommensurable over the rationals (i.e., no linear combi- 
nation of a, (3, 7, 5 with rational coefficients vanishes), and define 

S = aSi + f3S 2 + 7 5 3 + 5S 4 . (187) 

One sees immediately that the specification of the eigenvalue of S suffices to 
uniquely identify the eigenvalues of H, L 2 , L z and a z . 

Von Neumann thus arrived at the typical statement of a problem in modern 
quantum mechanics. There is no need anymore for g's and p's, where the p's do 
not commute with the g's. Instead one identifies a complete set of commuting 
operators. Since all members of the set commute with one another, they can 
all be viewed as g's. The canonically conjugate p's do not make an appearance. 

To conclude this section, we want to draw attention to one more passage 
in Wahrscheinlichkeitstheoretischer Aufbau. Both Jordan and von Neumann 
considered conditional probabilities of the form 

Pr(a has the value a \ b has the value b), 
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or, more generally, 



Pr(a has a value in interval I \ b has a value in interval J). 

To test the quantum-mechanical predictions for these probabilities one needs 
to prepare a system in a pure state in which b has the value b or in a mixed 
state in which b has a value in the interval /, and then measure a. What 
happens after that measurement? Elaborating on Heisenberg's (1927b) ideas 
in the uncertainty paper (cf. note 137), von Neumann addressed this question 
in the concluding section of Wahrscheinlichkeitstheoretischer Aufbau: 

A system left to itself (not disturbed by any measurements) has a com- 
pletely causal time evolution [governed by the Schrodinger equation]. In 
the confrontation with experiments, however, the statistical character is 
unavoidable: for every experiment there is a state adapted [angepafit] to 
it in which the result is uniquely determined (the experiment in fact pro- 
duces such states if they were not there before); however, for every state 
there are "non-adapted" measurements, the execution of which demolishes 
\zertriim rneri\ that state and pr oduces adapted states according to stochas- 



tic laws (jvon Neumann! . Il927bl . pp. 271-272). 



As far as we know, this is the first time the infamous collapse of the state 
vector in quantum mechanics was mentioned in print. 



7 Conclusion: Never mind your p's and g's 



The postulates of Jordan's Neue Begriindung papers amount to a concise for- 
mulation of the fundamental tenets of the probabilistic in terpreta t ion of q uan- 



tum mechanics. Building on insights of Born and Pauli, iJordanl fll927bf ) was 
the first to state in full generality that probabilities in quantum mechanics are 
given by the absolute square of what he called probability amplitudes. He was 
also the first to recog nize in full generality the peculiar rules for combining 
probability amplitudes ] 141 1 However, after laying down these rules in a set of 
postulates, he did not fully succeed in constructing a satisfactory formalism 
realizing those postulates. 

As we argued in this paper, Jordan was lacking the requisite mathematical 
tools to do so, namely abstract Hilbert space and the spectral theorem for 
operators acting in Hilbert space. Instead, Jordan drew on the canonical for- 
malism of classical mechanics. Jordan was steeped in this formalism, which 
had played a central role in the transition from the old quantum theory to 



1'llT. 



In his uncertainty paper, however. iHeisenbera (|1927bl . pp. 183-184, p. 196) criti 



cized Jordan's notion of the "interference of probabilities" (see note 38). 
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matrix mechanics (jDuncan and Janssenl . 120071 ) as well as in the further de- 
velopment of the ne w theory, to which Jo r dan had ma de a number of signif- 
icant contributions (jDuncan and Janssenl . 12008k 120091 ) . Most importantly in 
view of the project Jordan pursued in Neu e Begriindung, he had published 
two papers the year before (Jordan, 1926al lbl). in which he had invest igated 



the implementation of canonical t ransformations in matrix mechanics (lLackil . 
2004 ; Duncan and Janssenl . 2009 ). As he put it in his AHQP interview (see 
Section 2.2 for the full quotation), canonically conjugate variables and canon- 
ical transformations had thus been his "daily bread" in the years leading up 
to Neue Begriindung. 



Unfortunately, as we saw in Sections 2 and 4, this formalism — the p's and 
q's of the title of our paper — proved ill-suited to the task at hand. As a re- 
sult, Jordan ran into a number of serious problems. First, it turns out to be 
crucially important for the probability interpretation of the formalism that 
only Hermitian operators be allowed. Unfortunately, canonical transforma- 
tions can turn p's and g's corresponding to Hermitian operators into new P's 
and Q's corresponding to non-Hermitian ones. Initially, Jordan hoped to make 
room in his formalism for quantities corresponding to non-Hermitian operators 
by introducing the so-calle d Erg dnzungs amplitude (see Section 2. 4 ). Eve ntu- 
ally, following the lead of iHilbert. von Neumann, and Nordheiml ( 119281 ) , he 
dropped the Ergdnzungsamplitude, which forced him to restrict the class of 
allowed canonical transformations (rather arbitrarily from the point of view 
of classical mechanics) to those preserving Hermiticity. The difficulties facing 
Jordan 's appro ach became particularly severe when, in Neue Begriindung II, 
Jordan (Il927ef ) tried to extend his formalism, originally formulated only for 
quantities with purely continuous spectra, to quantities with wholly or partly 
discrete spectra. One problem with this extension was that, whereas canonical 
transformations do not necessarily preserve Hermiticity, they do preserve the 
spectra of the p's and g's to which they are applied. Hence, there is no canon- 
ical transformation that connects the generalized coordinate q, which has a 
continuous spectrum, to, for instance, the Hamiltonian H, which, in general, 
will have at least a partly discrete spectrum. As Jordan's construction of a 
realization of his postulates hinged on the existence of a canonical transfor- 
mation connecting q and H, this presented an insurmountable obstacle. The 
newly introduced spin variables further exposed the limitations of Jordan's 
canonical formalism. To subsume these variables under his general approach, 
Jordan had to weaken his definition of canonically conjugate quantities to such 
an extent that the concept lost much of its meaning. Under Jordan's definition 
in Neue Begriindung II, any two of the three components a x , a y , and a z of 
spin angular momentum are canonically conjugate to each other. 



All these problems can be avoided if the canonical formalism of classical me- 
chanics is replaced by the Hilbert space formalism, even though other math- 
ematical challenges remain. When Jordan's probability amplitudes (p(a, b) for 



102 



the quantities a and b are equated with 'inner products' (a\b) of normalized 
'eigenvectors' of the corresponding operators a and b, the rules for such am- 
plitudes, as laid down in the postulates of Jordan's Neue Begriindung, are 
automatically satisfied. Probabilities are given by the absolute square of these 
inner products, and Jordan's addition and multiplication rules for probability 
amplitudes essentially reduce to the familiar completeness and orthogonality 
relations in Hilbert space (see Section 2.1, Eq. (3)). Once the Hilbert space 
formalism is adopted, the need to sort quantities into p's and <f s disappears. 
Canonical transformations, at least in the classical sense as understood by 
Jordan, similarly cease to be important. Instead of canonical transformations 
(p, q) — > (TpT" 1 , TqT" 1 ) of pairs of canonically conjugate quantities, one now 
considers unitary transformations A — > UAU^ 1 of individual Hermitian op- 
erators. Such transformations get us from one orthonormal basis of Hilbert 
space to another, preserving inner products as required by the probability 
interpretation of quantum theory. 



The Hilbert space formalism was introduced by Ivon Neumann! ( 11927a! ) in 
Mathematische Begriindung. However, von Neumann did not use this formal- 
ism to provide a realization of Jordan's postulates along the lines sketched in 
the preceding paragraph. As we saw in Section 5, Von Neumann had some 
fundamental objections to the approach of Jordan (and Dirac). The basic 
probability amplitude for p and q in Jordan's formalism, (p\q) = e~ tpq l h (see 
Eq. (4)), is not a square-integrable function and is thus not an element of the 
space L 2 instantiating abstract Hilbert space. The delta function, unavoidable 
in the Dirac- Jordan formalism, is no function at all. Moreover, von Neumann 
objected to the phase-ambiguity of the probability amplitudes. 



Jordan's response to this last objection illustrates the extent to which he was 
still trapped in thinking solely in terms of p's and q's. In Neue Begriindung II, 
he eliminated the phase-ambiguity of the probability amplitude for any two 
quantities by adding two indices indicating a specific choice of the quantities 
canonically conjugate to those two quantities (see Section 4, Eqs. (133)— (139)). 
Von Neumann's response to this same problem was very different and under- 
scores that he was not wedded at all to the canonical formalism of classical 
mechanics. Von Neumann decided to avoid probability amplitudes altogether. 
Instead he turned to projection operators in Hilbert space, which he used 
both to formulate the spectral theorem and to construct a new formula for 
conditional probabilities in quantum mechanics (see Eq. (156) and Eq. (164)). 



Although von Neumann took Jordan's formula for conditional probabilities 
as his starting point and rewrote it in terms of projection operators, his final 
formula is more general than Jordan's in that it pertains both to pure and 
to mixed states. However, it was not until the next i nstallment of h is 1927 
trilogy, Wahrscheinlichkeitstheoretischer Aufbau, that Ivon Neumann! (jl927bl ) 
carefully defined the difference between pure and mixed states. In this paper, 
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von Neumann freed his approach from reliance on Jordan's even further (see 
Section 6). He now derived his formula for conditional probabilities in terms 
of the trace of products of projection operators from the Hilbert space formal- 
ism, using a few seemingly innocuous assumptions about expectation values 
of observables of systems in an ensemble of copies of those systems charac- 
terized by a density operator. He then showed that the density operator for a 
uniform ensemble is just the projection operator onto the corresponding pure 
dispersion-free state. Such pure states can be characterized completely by the 
eigenvalues of a complete set of commuting operators. This led von Neumann 
to a new way of formulating a typical problem in quantum mechanics. Rather 
than identifying p's and g's for the system under consideration, he realized 
that it suffices to specify the values of a maximal set of commuting operators 
for the system. All operators in such sets can be thought of as q's. There is 
no need to find the p's canonically conjugate to these g's. 



Coda: Return of the p's and g's in Quantum Field Theory 



In this paper, we emphasized the difficulties engendered by Jordan's insis- 
tence on the primacy of canonical (p, q) variables in expressing the dynam- 
ics of general quantum-mechanical systems (see especially Section 4 on Neue 
Begriindung II). These difficulties became particularly acute in the case of sys- 
tems with observables with purely or partially discrete spectra, of which the 
most extreme case is perhaps the treatment of electron spin (see Eqs. (119)— 
(121)). Here, the arbitrary choice of two out of the three spin components 
to serve as a non-commuting canonically conjugate pair clearly reveals the 
artificiality of this program. 

In a sense, however, Jordan was perfectly right to insist on the importance of 
a canonical approach, even for particles with spin. In hindsight, his error was 
simply to attempt to impose this structure at the level of non-relativistic 
quantum theory, where electron spin appears as an essentially mysterious 
internal attribute that must be grafted onto the nonrelativistic kinematics 
(which does have a perfectly sensible canonical interpretation). Once electron 
spin was shown to emerge naturally at the relativistic level, and all aspects of 
the electron's dynamics were subsumed in the behavior of a local relativistic 
field, canonical L p & q 1 thinking c ould be reinstated in a p erfect ly natural way. 



This was first done explicitly by iHeisenberg and Paulil (119291 ) in their semi- 



nal paper on Lagrangian field theory. In modern notation, they introduced a 
relativistically invariant action for a spin-| field, as a spacetime integral of a 
Lagrangian: 

S= [ Cd 4 x = [ ${x){i'fd li - m)^(x)d 4 x. (188) 



Here the field ip(x) is a four-component field, with the 7 M (/i = 0, 1, 2, 3) the 4x 
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4 Dirac matrices. A conjugate momentum field ir^(x) = dC/dip is then defined 
in the standard fashion, with canonical e qual-time antz- com mutation relations 
i mpos ed — as indicated by earlier work by Jordan! ( 1927h ) and Jordan and Wigner 



( 119281 ) — in order to insert the desired fermionic statistics of the particles de- 



scribed by the field, 

{7rt(x,t),i) n (y,t)} = iM m J\x-y). (189) 

The transition from a Lagrangian to a Hamiltonian (density) is then carried 
out in the usual way 

U = Tt*$-£ = ${tf-V + m)vl>{x). (190) 

The spatial integral of this Hamiltonian energy density would within a few 
years be shown to describe exactly the free Hamiltonian for arbitrary multi- 
particle states of non-interacting electrons and positrons, degenerate in mass 
and each displaying the usual panoply of spin-^ behavior which had finally 
been deciphered, in the non-relativistic context, by the atomic spectroscopy 
of the mid to late 1920s. The essential point is that the relevant p's and 
g's appearing in the theory are not associated with the first-quantized wave- 
functions or state vectors appropriate for a non-relativistic treatment, but 
rather with the fields that must replace them once a fully relativistic theory 
takes center stage. 
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